Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 35
Filtrar
Más filtros

Bases de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Nucleic Acids Res ; 47(2): e9, 2019 01 25.
Artículo en Inglés | MEDLINE | ID: mdl-30357413

RESUMEN

We present a new method, CIDER-Seq (Circular DNA Enrichment sequencing) for the unbiased enrichment and long-read sequencing of viral-sized circular DNA molecules. We used CIDER-Seq to produce single-read full-length virus genomes for the first time. CIDER-Seq combines PCR-free virus enrichment with Single Molecule Real Time sequencing and a new sequence de-concatenation algorithm. We apply our technique to produce >1200 full-length, highly accurate geminivirus genomes from RNAi-transgenic and control plants in a field trial in Kenya. Using CIDER-Seq we can demonstrate for the first time that the expression of antiviral double-stranded RNA (dsRNA) in transgenic plants causes a consistent shift in virus populations towards species sharing low homology to the transgene derived dsRNA. Our method and its application in an economically important crop plant opens new possibilities in periodic virus sequence surveillance and accurate profiling of diverse circular DNA elements.


Asunto(s)
ADN Circular/química , ADN Viral/química , Geminiviridae/genética , Genoma Viral , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Plantas Modificadas Genéticamente/virología , Análisis de Secuencia de ADN/métodos , Algoritmos , Plantas Modificadas Genéticamente/genética , Interferencia de ARN
2.
Genome Res ; 27(12): 2083-2095, 2017 12.
Artículo en Inglés | MEDLINE | ID: mdl-29141959

RESUMEN

Accurate annotation of all protein-coding sequences (CDSs) is an essential prerequisite to fully exploit the rapidly growing repertoire of completely sequenced prokaryotic genomes. However, large discrepancies among the number of CDSs annotated by different resources, missed functional short open reading frames (sORFs), and overprediction of spurious ORFs represent serious limitations. Our strategy toward accurate and complete genome annotation consolidates CDSs from multiple reference annotation resources, ab initio gene prediction algorithms and in silico ORFs (a modified six-frame translation considering alternative start codons) in an integrated proteogenomics database (iPtgxDB) that covers the entire protein-coding potential of a prokaryotic genome. By extending the PeptideClassifier concept of unambiguous peptides for prokaryotes, close to 95% of the identifiable peptides imply one distinct protein, largely simplifying downstream analysis. Searching a comprehensive Bartonella henselae proteomics data set against such an iPtgxDB allowed us to unambiguously identify novel ORFs uniquely predicted by each resource, including lipoproteins, differentially expressed and membrane-localized proteins, novel start sites and wrongly annotated pseudogenes. Most novelties were confirmed by targeted, parallel reaction monitoring mass spectrometry, including unique ORFs and single amino acid variations (SAAVs) identified in a re-sequenced laboratory strain that are not present in its reference genome. We demonstrate the general applicability of our strategy for genomes with varying GC content and distinct taxonomic origin. We release iPtgxDBs for B. henselae, Bradyrhizobium diazoefficiens and Escherichia coli and the software to generate both proteogenomics search databases and integrated annotation files that can be viewed in a genome browser for any prokaryote.


Asunto(s)
Proteínas Bacterianas/genética , Bartonella henselae/genética , Bradyrhizobium/genética , Escherichia coli/genética , Genoma Bacteriano , Proteogenómica , Bases de Datos de Proteínas , Anotación de Secuencia Molecular , Sistemas de Lectura Abierta , Programas Informáticos
3.
Nucleic Acids Res ; 46(17): 8953-8965, 2018 09 28.
Artículo en Inglés | MEDLINE | ID: mdl-30137508

RESUMEN

Generating a complete, de novo genome assembly for prokaryotes is often considered a solved problem. However, we here show that Pseudomonas koreensis P19E3 harbors multiple, near identical repeat pairs up to 70 kilobase pairs in length, which contained several genes that may confer fitness advantages to the strain. Its complex genome, which also included a variable shufflon region, could not be de novo assembled with long reads produced by Pacific Biosciences' technology, but required very long reads from Oxford Nanopore Technologies. Importantly, a repeat analysis, whose results we release for over 9600 prokaryotes, indicated that very complex bacterial genomes represent a general phenomenon beyond Pseudomonas. Roughly 10% of 9331 complete bacterial and a handful of 293 complete archaeal genomes represented this 'dark matter' for de novo genome assembly of prokaryotes. Several of these 'dark matter' genome assemblies contained repeats far beyond the resolution of the sequencing technology employed and likely contain errors, other genomes were closed employing labor-intense steps like cosmid libraries, primer walking or optical mapping. Using very long sequencing reads in combination with assembly algorithms capable of resolving long, near identical repeats will bring most prokaryotic genomes within reach of fast and complete de novo genome assembly.


Asunto(s)
Algoritmos , Mapeo Cromosómico/métodos , ADN Bacteriano/química , Genoma Bacteriano , Repeticiones de Microsatélite , Pseudomonas/genética , ADN Bacteriano/genética , ADN Bacteriano/metabolismo , Ontología de Genes , Aptitud Genética , Secuenciación de Nucleótidos de Alto Rendimiento , Anotación de Secuencia Molecular , Origanum/microbiología , Filogenia , Hojas de la Planta/microbiología , Pseudomonas/clasificación , Pseudomonas/aislamiento & purificación , Pseudomonas/metabolismo , Pseudomonas aeruginosa/clasificación , Pseudomonas aeruginosa/genética , Pseudomonas aeruginosa/aislamiento & purificación , Pseudomonas aeruginosa/metabolismo , Pseudomonas putida/clasificación , Pseudomonas putida/genética , Pseudomonas putida/aislamiento & purificación , Pseudomonas putida/metabolismo
4.
BMC Biol ; 17(1): 75, 2019 09 18.
Artículo en Inglés | MEDLINE | ID: mdl-31533702

RESUMEN

BACKGROUND: Cassava is an important food crop in tropical and sub-tropical regions worldwide. In Africa, cassava production is widely affected by cassava mosaic disease (CMD), which is caused by the African cassava mosaic geminivirus that is transmitted by whiteflies. Cassava breeders often use a single locus, CMD2, for introducing CMD resistance into susceptible cultivars. The CMD2 locus has been genetically mapped to a 10-Mbp region, but its organization and genes as well as their functions are unknown. RESULTS: We report haplotype-resolved de novo assemblies and annotations of the genomes for the African cassava cultivar TME (tropical Manihot esculenta), which is the origin of CMD2, and the CMD-susceptible cultivar 60444. The assemblies provide phased haplotype information for over 80% of the genomes. Haplotype comparison identified novel features previously hidden in collapsed and fragmented cassava genomes, including thousands of allelic variants, inter-haplotype diversity in coding regions, and patterns of diversification through allele-specific expression. Reconstruction of the CMD2 locus revealed a highly complex region with nearly identical gene sets but limited microsynteny between the two cultivars. CONCLUSIONS: The genome maps of the CMD2 locus in both 60444 and TME3, together with the newly annotated genes, will help the identification of the causal genetic basis of CMD2 resistance to geminiviruses. Our de novo cassava genome assemblies will also facilitate genetic mapping approaches to narrow the large CMD2 region to a few candidate genes for better informed strategies to develop robust geminivirus resistance in susceptible cassava cultivars.


Asunto(s)
Resistencia a la Enfermedad/genética , Haplotipos/genética , Manihot/genética , Enfermedades de las Plantas/genética , Mapeo Cromosómico/métodos , Susceptibilidad a Enfermedades , Geminiviridae , Predisposición Genética a la Enfermedad , Anotación de Secuencia Molecular
5.
PLoS Genet ; 12(12): e1006499, 2016 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-27997543

RESUMEN

Heritable DNA methylation imprints are ubiquitous and underlie genetic variability from bacteria to humans. In microbial genomes, DNA methylation has been implicated in gene transcription, DNA replication and repair, nucleoid segregation, transposition and virulence of pathogenic strains. Despite the importance of local (hypo)methylation at specific loci, how and when these patterns are established during the cell cycle remains poorly characterized. Taking advantage of the small genomes and the synchronizability of α-proteobacteria, we discovered that conserved determinants of the cell cycle transcriptional circuitry establish specific hypomethylation patterns in the cell cycle model system Caulobacter crescentus. We used genome-wide methyl-N6-adenine (m6A-) analyses by restriction-enzyme-cleavage sequencing (REC-Seq) and single-molecule real-time (SMRT) sequencing to show that MucR, a transcriptional regulator that represses virulence and cell cycle genes in S-phase but no longer in G1-phase, occludes 5'-GANTC-3' sequence motifs that are methylated by the DNA adenine methyltransferase CcrM. Constitutive expression of CcrM or heterologous methylases in at least two different α-proteobacteria homogenizes m6A patterns even when MucR is present and affects promoter activity. Environmental stress (phosphate limitation) can override and reconfigure local hypomethylation patterns imposed by the cell cycle circuitry that dictate when and where local hypomethylation is instated.


Asunto(s)
Caulobacter crescentus/genética , Ciclo Celular/genética , Metilación de ADN/genética , Transcripción Genética , División Celular/genética , Replicación del ADN/efectos de los fármacos , Replicación del ADN/genética , Regulación Bacteriana de la Expresión Génica , Genoma Microbiano , Metiltransferasas/genética , Fosfatos/metabolismo , Regiones Promotoras Genéticas , Metiltransferasa de ADN de Sitio Específico (Adenina Especifica)/genética , Inanición/genética , Inanición/metabolismo
6.
Nucleic Acids Res ; 43(11): e76, 2015 Jun 23.
Artículo en Inglés | MEDLINE | ID: mdl-25820422

RESUMEN

Whole exome sequencing (WES) is increasingly used in research and diagnostics. WES users expect coverage of the entire coding region of known genes as well as sufficient read depth for the covered regions. It is, however, unknown which recent WES platform is most suitable to meet these expectations. We present insights into the performance of the most recent standard exome enrichment platforms from Agilent, NimbleGen and Illumina applied to six different DNA samples by two sequencing vendors per platform. Our results suggest that both Agilent and NimbleGen overall perform better than Illumina and that the high enrichment performance of Agilent is stable among samples and between vendors, whereas NimbleGen is only able to achieve vendor- and sample-specific best exome coverage. Moreover, the recent Agilent platform overall captures more coding exons with sufficient read depth than NimbleGen and Illumina. Due to considerable gaps in effective exome coverage, however, the three platforms cannot capture all known coding exons alone or in combination, requiring improvement. Our data emphasize the importance of evaluation of updated platform versions and suggest that enrichment-free whole genome sequencing can overcome the limitations of WES in sufficiently covering coding exons, especially GC-rich regions, and in characterizing structural variants.


Asunto(s)
Exoma , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ADN/métodos , Alelos , Composición de Base , ADN/química , Humanos
7.
New Phytol ; 212(3): 780-791, 2016 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-27381250

RESUMEN

Community analyses of arbuscular mycorrhizal fungi (AMF) using ribosomal small subunit (SSU) or internal transcribed spacer (ITS) DNA sequences often suffer from low resolution or coverage. We developed a novel sequencing based approach for a highly resolving and specific profiling of AMF communities. We took advantage of previously established AMF-specific PCR primers that amplify a c. 1.5-kb long fragment covering parts of SSU, ITS and parts of the large ribosomal subunit (LSU), and we sequenced the resulting amplicons with single molecule real-time (SMRT) sequencing. The method was applicable to soil and root samples, detected all major AMF families and successfully discriminated closely related AMF species, which would not be discernible using SSU sequences. In inoculation tests we could trace the introduced AMF inoculum at the molecular level. One of the introduced strains almost replaced the local strain(s), revealing that AMF inoculation can have a profound impact on the native community. The methodology presented offers researchers a powerful new tool for AMF community analysis because it unifies improved specificity and enhanced resolution, whereas the drawback of medium sequencing throughput appears of lesser importance for low-diversity groups such as AMF.


Asunto(s)
Glomeromycota/fisiología , Micorrizas/fisiología , ADN de Hongos/genética , Operón/genética , ARN Ribosómico/genética , Análisis de Secuencia de ADN , Microbiología del Suelo
8.
Nucleic Acids Res ; 42(14): e115, 2014 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-24972832

RESUMEN

Next-generation sequencing (NGS) technologies enable new insights into the diversity of virus populations within their hosts. Diversity estimation is currently restricted to single-nucleotide variants or to local fragments of no more than a few hundred nucleotides defined by the length of sequence reads. To study complex heterogeneous virus populations comprehensively, novel methods are required that allow for complete reconstruction of the individual viral haplotypes. Here, we show that assembly of whole viral genomes of ∼8600 nucleotides length is feasible from mixtures of heterogeneous HIV-1 strains derived from defined combinations of cloned virus strains and from clinical samples of an HIV-1 superinfected individual. Haplotype reconstruction was achieved using optimized experimental protocols and computational methods for amplification, sequencing and assembly. We comparatively assessed the performance of the three NGS platforms 454 Life Sciences/Roche, Illumina and Pacific Biosciences for this task. Our results prove and delineate the feasibility of NGS-based full-length viral haplotype reconstruction and provide new tools for studying evolution and pathogenesis of viruses.


Asunto(s)
Variación Genética , VIH-1/genética , Haplotipos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Genoma Viral , Infecciones por VIH/virología , Humanos
9.
BMC Genomics ; 15: 16, 2014 Jan 10.
Artículo en Inglés | MEDLINE | ID: mdl-24405840

RESUMEN

BACKGROUND: High-throughput sequencing has opened up exciting possibilities in population and conservation genetics by enabling the assessment of genetic variation at genome-wide scales. One approach to reduce genome complexity, i.e. investigating only parts of the genome, is reduced-representation library (RRL) sequencing. Like similar approaches, RRL sequencing reduces ascertainment bias due to simultaneous discovery and genotyping of single-nucleotide polymorphisms (SNPs) and does not require reference genomes. Yet, generating such datasets remains challenging due to laboratory and bioinformatical issues. In the laboratory, current protocols require improvements with regards to sequencing homologous fragments to reduce the number of missing genotypes. From the bioinformatical perspective, the reliance of most studies on a single SNP caller disregards the possibility that different algorithms may produce disparate SNP datasets. RESULTS: We present an improved RRL (iRRL) protocol that maximizes the generation of homologous DNA sequences, thus achieving improved genotyping-by-sequencing efficiency. Our modifications facilitate generation of single-sample libraries, enabling individual genotype assignments instead of pooled-sample analysis. We sequenced ~1% of the orangutan genome with 41-fold median coverage in 31 wild-born individuals from two populations. SNPs and genotypes were called using three different algorithms. We obtained substantially different SNP datasets depending on the SNP caller. Genotype validations revealed that the Unified Genotyper of the Genome Analysis Toolkit and SAMtools performed significantly better than a caller from CLC Genomics Workbench (CLC). Of all conflicting genotype calls, CLC was only correct in 17% of the cases. Furthermore, conflicting genotypes between two algorithms showed a systematic bias in that one caller almost exclusively assigned heterozygotes, while the other one almost exclusively assigned homozygotes. CONCLUSIONS: Our enhanced iRRL approach greatly facilitates genotyping-by-sequencing and thus direct estimates of allele frequencies. Our direct comparison of three commonly used SNP callers emphasizes the need to question the accuracy of SNP and genotype calling, as we obtained considerably different SNP datasets depending on caller algorithms, sequencing depths and filtering criteria. These differences affected scans for signatures of natural selection, but will also exert undue influences on demographic inferences. This study presents the first effort to generate a population genomic dataset for wild-born orangutans with known population provenance.


Asunto(s)
Algoritmos , Bases de Datos Genéticas , Genoma , Genómica/métodos , Polimorfismo de Nucleótido Simple , Pongo abelii/genética , Animales , Biología Computacional , Frecuencia de los Genes , Genotipo , Heterocigoto , Secuenciación de Nucleótidos de Alto Rendimiento , Programas Informáticos
10.
Eur J Neurosci ; 38(11): 3567-79, 2013 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-24103058

RESUMEN

Adult central nervous system axons show restricted growth and regeneration properties after injury. One of the underlying mechanisms is the activation of the Nogo-A/Nogo receptor (NgR1) signaling pathway. Nogo-A knockout (KO) mice show enhanced regenerative growth in vivo, even though it is less pronounced than after acute antibody-mediated neutralization of Nogo-A. Residual inhibition may involve a compensatory component. By mRNA expression profiling and immunoblots we show increased expression of several members of the Ephrin/Eph and Semaphorin/Plexin families of axon guidance molecules, e.g. EphrinA3 and EphA4, in the intact spinal cord of adult Nogo-A KO vs. wild-type (WT) mice. EphrinA3 inhibits neurite outgrowth of EphA4-positive neurons in vitro. In addition, EphrinA3 KO myelin extracts are less growth-inhibitory than WT but more than Nogo-A KO myelin extracts. EphA4 KO cortical neurons show decreased growth inhibition on Nogo-A KO myelin as compared with WT neurons, supporting increased EphA4-mediated growth inhibition in Nogo-A KO mice. Consistently, in vivo, Nogo-A/EphA4 double KO mice show increased axonal sprouting and regeneration after spinal cord injury as compared with EphA4 KO mice. Our results reveal the upregulation of developmental axon guidance cues following constitutive Nogo-A deletion, e.g. the EphrinA3/EphA4 ligand/receptor pair, and support their role in restricting neurite outgrowth in the absence of Nogo-A.


Asunto(s)
Axones/fisiología , Corteza Cerebral/metabolismo , Ganglios Espinales/metabolismo , Proteínas de la Mielina/metabolismo , Regeneración de la Medula Espinal , Regulación hacia Arriba , Animales , Axones/metabolismo , Células Cultivadas , Corteza Cerebral/patología , Corteza Cerebral/fisiología , Efrina-A3/genética , Efrina-A3/metabolismo , Efrina-A4/genética , Efrina-A4/metabolismo , Ganglios Espinales/patología , Ganglios Espinales/fisiología , Ratones , Ratones Endogámicos C57BL , Ratones Noqueados , Proteínas de la Mielina/genética , Vaina de Mielina/genética , Vaina de Mielina/metabolismo , Proteínas Nogo , Tractos Piramidales/metabolismo , Tractos Piramidales/patología , Tractos Piramidales/fisiología , ARN Mensajero/genética , ARN Mensajero/metabolismo , Traumatismos de la Médula Espinal/metabolismo
11.
Gigascience ; 112022 03 24.
Artículo en Inglés | MEDLINE | ID: mdl-35333302

RESUMEN

BACKGROUND: Cassava (Manihot esculenta) is an important clonally propagated food crop in tropical and subtropical regions worldwide. Genetic gain by molecular breeding has been limited, partially because cassava is a highly heterozygous crop with a repetitive and difficult-to-assemble genome. FINDINGS: Here we demonstrate that Pacific Biosciences high-fidelity (HiFi) sequencing reads, in combination with the assembler hifiasm, produced genome assemblies at near complete haplotype resolution with higher continuity and accuracy compared to conventional long sequencing reads. We present 2 chromosome-scale haploid genomes phased with Hi-C technology for the diploid African cassava variety TME204. With consensus accuracy >QV46, contig N50 >18 Mb, BUSCO completeness of 99%, and 35k phased gene loci, it is the most accurate, continuous, complete, and haplotype-resolved cassava genome assembly so far. Ab initio gene prediction with RNA-seq data and Iso-Seq transcripts identified abundant novel gene loci, with enriched functionality related to chromatin organization, meristem development, and cell responses. During tissue development, differentially expressed transcripts of different haplotype origins were enriched for different functionality. In each tissue, 20-30% of transcripts showed allele-specific expression (ASE) differences. ASE bias was often tissue specific and inconsistent across different tissues. Direction-shifting was observed in <2% of the ASE transcripts. Despite high gene synteny, the HiFi genome assembly revealed extensive chromosome rearrangements and abundant intra-genomic and inter-genomic divergent sequences, with large structural variations mostly related to LTR retrotransposons. We use the reference-quality assemblies to build a cassava pan-genome and demonstrate its importance in representing the genetic diversity of cassava for downstream reference-guided omics analysis and breeding. CONCLUSIONS: The phased and annotated chromosome pairs allow a systematic view of the heterozygous diploid genome organization in cassava with improved accuracy, completeness, and haplotype resolution. They will be a valuable resource for cassava breeding and research. Our study may also provide insights into developing cost-effective and efficient strategies for resolving complex genomes with high resolution, accuracy, and continuity.


Asunto(s)
Manihot , Alelos , Cromosomas , Diploidia , Haplotipos , Manihot/genética , Fitomejoramiento , Análisis de Secuencia de ADN , Transcriptoma
12.
Plant Physiol ; 152(2): 487-99, 2010 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-20032078

RESUMEN

Transcriptome profiling has become a routine tool in biology. For Arabidopsis (Arabidopsis thaliana), the Affymetrix ATH1 expression array is most commonly used, but it lacks about one-third of all annotated genes present in the reference strain. An alternative are tiling arrays, but previous designs have not allowed the simultaneous analysis of both strands on a single array. We introduce AGRONOMICS1, a new Affymetrix Arabidopsis microarray that contains the complete paths of both genome strands, with on average one 25mer probe per 35-bp genome sequence window. In addition, the new AGRONOMICS1 array contains all perfect match probes from the original ATH1 array, allowing for seamless integration of the very large existing ATH1 knowledge base. The AGRONOMICS1 array can be used for diverse functional genomics applications such as reliable expression profiling of more than 30,000 genes, detection of alternative splicing, and chromatin immunoprecipitation coupled to microarrays (ChIP-chip). Here, we describe the design of the array and compare its performance with that of the ATH1 array. We find results from both microarrays to be of similar quality, but AGRONOMICS1 arrays yield robust expression information for many more genes, as expected. Analysis of the ATH1 probes on AGRONOMICS1 arrays produces results that closely mirror those of ATH1 arrays. Finally, the AGRONOMICS1 array is shown to be useful for ChIP-chip experiments. We show that heterochromatic H3K9me2 is strongly confined to the gene body of target genes in euchromatic chromosome regions, suggesting that spreading of heterochromatin is limited outside of pericentromeric regions.


Asunto(s)
Arabidopsis/genética , Perfilación de la Expresión Génica/métodos , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Inmunoprecipitación de Cromatina , Biología Computacional , Sondas de ADN , Genes de Plantas , Genómica , ARN de Planta/genética , Análisis de Secuencia de ADN
13.
Gene ; 793: 145748, 2021 Aug 15.
Artículo en Inglés | MEDLINE | ID: mdl-34077775

RESUMEN

The rice root-knot nematode Meloidogyne graminicola is a major biotic stress for the rice crop under upland, rain-fed lowland and irrigated cultivation conditions. Here, we present an improved draft genome assembly of M. graminicola IARI strain using the long-read sequencing approach (PacBio Sequel platform). The assembled genome size was 36.86 Mb with 514 contigs and N50 value of 105 kb. BUSCO estimated the genome to be 88.6% complete. Meloidogyne graminicola genome contained 17.83% repeat elements and showed 14,062 protein-coding gene models, 4,974 conserved orthologous genes, 561 putative secreted proteins, 49 RNAi pathway genes, 1,853 proteins involved in pathogen-host interactions, 1,575 carbohydrate-active enzymes, and 32,138 microsatellites. Five of the carbohydrate-active enzymes were found only in M. graminicola genome and were not present in any other analysed root-knot nematode genome. Together with the previous two genome assemblies, this improved genome assembly would facilitate comparative and functional genomics for M. graminicola.


Asunto(s)
Genes de Helminto , Genoma de los Helmintos , Proteínas del Helminto/genética , Oryza/parasitología , Tylenchoidea/genética , Animales , Ontología de Genes , Tamaño del Genoma , Proteínas del Helminto/clasificación , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Repeticiones de Microsatélite , Anotación de Secuencia Molecular , Sistemas de Lectura Abierta , Filogenia , Enfermedades de las Plantas/parasitología , Tylenchoidea/clasificación
14.
Epidemics ; 37: 100480, 2021 12.
Artículo en Inglés | MEDLINE | ID: mdl-34488035

RESUMEN

BACKGROUND: In December 2020, the United Kingdom (UK) reported a SARS-CoV-2 Variant of Concern (VoC) which is now named B.1.1.7. Based on initial data from the UK and later data from other countries, this variant was estimated to have a transmission fitness advantage of around 40-80 % (Volz et al., 2021; Leung et al., 2021; Davies et al., 2021). AIM: This study aims to estimate the transmission fitness advantage and the effective reproductive number of B.1.1.7 through time based on data from Switzerland. METHODS: We generated whole genome sequences from 11.8 % of all confirmed SARS-CoV-2 cases in Switzerland between 14 December 2020 and 11 March 2021. Based on these data, we determine the daily frequency of the B.1.1.7 variant and quantify the variant's transmission fitness advantage on a national and a regional scale. RESULTS: We estimate B.1.1.7 had a transmission fitness advantage of 43-52 % compared to the other variants circulating in Switzerland during the study period. Further, we estimate B.1.1.7 had a reproductive number above 1 from 01 January 2021 until the end of the study period, compared to below 1 for the other variants. Specifically, we estimate the reproductive number for B.1.1.7 was 1.24 [1.07-1.41] from 01 January until 17 January 2021 and 1.18 [1.06-1.30] from 18 January until 01 March 2021 based on the whole genome sequencing data. From 10 March to 16 March 2021, once B.1.1.7 was dominant, we estimate the reproductive number was 1.14 [1.00-1.26] based on all confirmed cases. For reference, Switzerland applied more non-pharmaceutical interventions to combat SARS-CoV-2 on 18 January 2021 and lifted some measures again on 01 March 2021. CONCLUSION: The observed increase in B.1.1.7 frequency in Switzerland during the study period is as expected based on observations in the UK. In absolute numbers, B.1.1.7 increased exponentially with an estimated doubling time of around 2-3.5 weeks. To monitor the ongoing spread of B.1.1.7, our plots are available online.


Asunto(s)
COVID-19 , SARS-CoV-2 , Humanos , Suiza/epidemiología , Reino Unido
15.
Mol Cell Biol ; 39(23)2019 12 01.
Artículo en Inglés | MEDLINE | ID: mdl-31548262

RESUMEN

The enhancer/promoter of the vitellogenin II gene (VTG) has been extensively studied as a model system of vertebrate transcriptional control. While deletion mutagenesis and in vivo footprinting identified the transcription factor (TF) binding sites governing its tissue specificity, DNase hypersensitivity and DNA methylation studies revealed the epigenetic changes accompanying its hormone-dependent activation. Moreover, upon induction with estrogen (E2), the region flanking the estrogen-responsive element (ERE) was reported to undergo active DNA demethylation. We now show that although the VTG ERE is methylated in embryonic chicken liver and in LMH/2A hepatocytes, its induction by E2 was not accompanied by extensive demethylation. In contrast, E2 failed to activate a VTG enhancer/promoter-controlled luciferase reporter gene methylated by SssI. Surprisingly, this inducibility difference could be traced not to the ERE but rather to a single CpG in an E-box (CACGTG) sequence upstream of the VTG TATA box, which is unmethylated in vivo but methylated by SssI. We demonstrate that this E-box binds the upstream stimulating factor USF1/2. Selective methylation of the CpG within this binding site with an E-box-specific DNA methyltransferase, Eco72IM, was sufficient to attenuate USF1/2 binding in vitro and abolish the hormone-induced transcription of the VTG gene in the reporter system.


Asunto(s)
Expresión Génica Ectópica/genética , Receptor alfa de Estrógeno/genética , Vitelogeninas/genética , Animales , Sitios de Unión , Línea Celular , Embrión de Pollo , Islas de CpG/genética , Metilación de ADN/genética , Proteínas de Unión al ADN/metabolismo , ADN-Citosina Metilasas/metabolismo , Expresión Génica Ectópica/efectos de los fármacos , Receptor alfa de Estrógeno/metabolismo , Estrógenos/metabolismo , Regulación de la Expresión Génica/efectos de los fármacos , Genes Reporteros , Humanos , Regiones Promotoras Genéticas/efectos de los fármacos , Regiones Promotoras Genéticas/genética , Secuencias Reguladoras de Ácidos Nucleicos , Factores de Transcripción/metabolismo , Vitelogeninas/metabolismo
16.
Nat Commun ; 10(1): 3359, 2019 07 31.
Artículo en Inglés | MEDLINE | ID: mdl-31366910

RESUMEN

A platform for highly parallel direct sequencing of native RNA strands was recently described by Oxford Nanopore Technologies, but despite initial efforts it remains crucial to further investigate the technology for quantification of complex transcriptomes. Here we undertake native RNA sequencing of polyA + RNA from two human cell lines, analysing ~5.2 million aligned native RNA reads. To enable informative comparisons, we also perform relevant ONT direct cDNA- and Illumina-sequencing. We find that while native RNA sequencing does enable some of the anticipated advantages, key unexpected aspects currently hamper its performance, most notably the quite frequent inability to obtain full-length transcripts from single reads, as well as difficulties to unambiguously infer their true transcript of origin. While characterising issues that need to be addressed when investigating more complex transcriptomes, our study highlights that with some defined improvements, native RNA sequencing could be an important addition to the mammalian transcriptomics toolbox.


Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ARN/métodos , Transcriptoma/genética , Secuencia de Bases/genética , Línea Celular , ADN Complementario/genética , Células HEK293 , Humanos , Poli A/genética
17.
Sci Rep ; 9(1): 16444, 2019 11 11.
Artículo en Inglés | MEDLINE | ID: mdl-31712730

RESUMEN

Pseudoalteromonas haloplanktis TAC125 is among the most commonly studied bacteria adapted to cold environments. Aside from its ecological relevance, P. haloplanktis has a potential use for biotechnological applications. Due to its importance, we decided to take advantage of next generation sequencing (Illumina) and third generation sequencing (PacBio and Oxford Nanopore) technologies to resequence its genome. The availability of a reference genome, obtained using whole genome shotgun sequencing, allowed us to study and compare the results obtained by the different technologies and draw useful conclusions for future de novo genome assembly projects. We found that assembly polishing using Illumina reads is needed to achieve a consensus accuracy over 99.9% when using Oxford Nanopore sequencing, but not in PacBio sequencing. However, the dependency of consensus accuracy on coverage is lower in Oxford Nanopore than in PacBio, suggesting that a cost-effective solution might be the use of low coverage Oxford Nanopore sequencing together with Illumina reads. Despite the differences in consensus accuracy, all sequencing technologies revealed the presence of a large plasmid, pMEGA, which was undiscovered until now. Among the most interesting features of pMEGA is the presence of a putative error-prone polymerase regulated through the SOS response. Aside from the characterization of the newly discovered plasmid, we confirmed the sequence of the small plasmid pMtBL and uncovered the presence of a potential partitioning system. Crucially, this study shows that the combination of next and third generation sequencing technologies give us an unprecedented opportunity to characterize our bacterial model organisms at a very detailed level.


Asunto(s)
Genoma Bacteriano , Genómica , Infecciones por Bacterias Gramnegativas/microbiología , Secuenciación de Nucleótidos de Alto Rendimiento , Pseudoalteromonas/genética , Organismos Acuáticos , Biología Computacional/métodos , Genómica/métodos , Anotación de Secuencia Molecular , Microbiología del Agua
18.
DNA Res ; 25(1): 39-47, 2018 Feb 01.
Artículo en Inglés | MEDLINE | ID: mdl-28985356

RESUMEN

Finger millet (Eleusine coracana (L.) Gaertn) is an important crop for food security because of its tolerance to drought, which is expected to be exacerbated by global climate changes. Nevertheless, it is often classified as an orphan/underutilized crop because of the paucity of scientific attention. Among several small millets, finger millet is considered as an excellent source of essential nutrient elements, such as iron and zinc; hence, it has potential as an alternate coarse cereal. However, high-quality genome sequence data of finger millet are currently not available. One of the major problems encountered in the genome assembly of this species was its polyploidy, which hampers genome assembly compared with a diploid genome. To overcome this problem, we sequenced its genome using diverse technologies with sufficient coverage and assembled it via a novel multiple hybrid assembly workflow that combines next-generation with single-molecule sequencing, followed by whole-genome optical mapping using the Bionano Irys® system. The total number of scaffolds was 1,897 with an N50 length >2.6 Mb and detection of 96% of the universal single-copy orthologs. The majority of the homeologs were assembled separately. This indicates that the proposed workflow is applicable to the assembly of other allotetraploid genomes.

19.
20.
Hum Genet ; 122(1): 23-32, 2007 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-17492313

RESUMEN

Mutations in the FBN1 gene are the major cause of Marfan syndrome (MFS), an autosomal dominant connective tissue disorder, which displays variable manifestations in the cardiovascular, ocular, and skeletal systems. Current molecular genetic testing of FBN1 may miss mutations in the promoter region or in other noncoding sequences as well as partial or complete gene deletions and duplications. In this study, we tested for copy number variations by successively applying multiplex ligation-dependent probe amplification (MLPA) and the Affymetrix Human Mapping 500 K Array Set, which contains probes for approximately 500,000 single-nucleotide polymorphisms (SNPs) across the genome. By analyzing genomic DNA of 101 unrelated individuals with MFS or related phenotypes in whom standard genetic testing detected no mutation, we identified FBN1 deletions in two patients with MFS. Our high-resolution approach narrowed down the deletion breakpoints. Subsequent sequencing of the junctional fragments revealed the deletion sizes of 26,887 and 302,580 bp, respectively. Surprisingly, both deletions affect the putative regulatory and promoter region of the FBN1 gene, strongly indicating that they abolish transcription of the deleted allele. This expectation of complete loss of function of one allele, i.e. true haploinsufficiency, was confirmed by transcript analyses. Our findings not only emphasize the importance of screening for large genomic rearrangements in comprehensive genetic testing of FBN1 but, importantly, also extend the molecular etiology of MFS by providing hitherto unreported evidence that true haploinsufficiency is sufficient to cause MFS.


Asunto(s)
Eliminación de Gen , Pérdida de Heterocigocidad , Síndrome de Marfan/genética , Proteínas de Microfilamentos/genética , Secuencia de Bases , Rotura Cromosómica , Estudios de Cohortes , Análisis Mutacional de ADN/métodos , Fibrilina-1 , Fibrilinas , Pruebas Genéticas , Haplotipos , Humanos , Datos de Secuencia Molecular , Polimorfismo de Nucleótido Simple , ARN Mensajero/análisis
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA