Pesquisa | Biblioteca Virtual em Saúde Fiocruz

1.

A new full-length circular DNA sequencing method for viral-sized genomes reveals that RNAi transgenic plants provoke a shift in geminivirus populations in the field.

Mehta, Devang; Hirsch-Hoffmann, Matthias; Were, Mariam; Patrignani, Andrea; Zaidi, Syed Shan-E-Ali; Were, Hassan; Gruissem, Wilhelm; Vanderschuren, Hervé.

Nucleic Acids Res ; 47(2): e9, 2019 01 25.

Artigo em Inglês | MEDLINE | ID: mdl-30357413

RESUMO

We present a new method, CIDER-Seq (Circular DNA Enrichment sequencing) for the unbiased enrichment and long-read sequencing of viral-sized circular DNA molecules. We used CIDER-Seq to produce single-read full-length virus genomes for the first time. CIDER-Seq combines PCR-free virus enrichment with Single Molecule Real Time sequencing and a new sequence de-concatenation algorithm. We apply our technique to produce >1200 full-length, highly accurate geminivirus genomes from RNAi-transgenic and control plants in a field trial in Kenya. Using CIDER-Seq we can demonstrate for the first time that the expression of antiviral double-stranded RNA (dsRNA) in transgenic plants causes a consistent shift in virus populations towards species sharing low homology to the transgene derived dsRNA. Our method and its application in an economically important crop plant opens new possibilities in periodic virus sequence surveillance and accurate profiling of diverse circular DNA elements.

Assuntos

DNA Circular/química , DNA Viral/química , Geminiviridae/genética , Genoma Viral , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Plantas Geneticamente Modificadas/virologia , Análise de Sequência de DNA/métodos , Algoritmos , Plantas Geneticamente Modificadas/genética , Interferência de RNA

2.

An integrative strategy to identify the entire protein coding potential of prokaryotic genomes by proteogenomics.

Omasits, Ulrich; Varadarajan, Adithi R; Schmid, Michael; Goetze, Sandra; Melidis, Damianos; Bourqui, Marc; Nikolayeva, Olga; Québatte, Maxime; Patrignani, Andrea; Dehio, Christoph; Frey, Juerg E; Robinson, Mark D; Wollscheid, Bernd; Ahrens, Christian H.

Genome Res ; 27(12): 2083-2095, 2017 12.

Artigo em Inglês | MEDLINE | ID: mdl-29141959

RESUMO

Accurate annotation of all protein-coding sequences (CDSs) is an essential prerequisite to fully exploit the rapidly growing repertoire of completely sequenced prokaryotic genomes. However, large discrepancies among the number of CDSs annotated by different resources, missed functional short open reading frames (sORFs), and overprediction of spurious ORFs represent serious limitations. Our strategy toward accurate and complete genome annotation consolidates CDSs from multiple reference annotation resources, ab initio gene prediction algorithms and in silico ORFs (a modified six-frame translation considering alternative start codons) in an integrated proteogenomics database (iPtgxDB) that covers the entire protein-coding potential of a prokaryotic genome. By extending the PeptideClassifier concept of unambiguous peptides for prokaryotes, close to 95% of the identifiable peptides imply one distinct protein, largely simplifying downstream analysis. Searching a comprehensive Bartonella henselae proteomics data set against such an iPtgxDB allowed us to unambiguously identify novel ORFs uniquely predicted by each resource, including lipoproteins, differentially expressed and membrane-localized proteins, novel start sites and wrongly annotated pseudogenes. Most novelties were confirmed by targeted, parallel reaction monitoring mass spectrometry, including unique ORFs and single amino acid variations (SAAVs) identified in a re-sequenced laboratory strain that are not present in its reference genome. We demonstrate the general applicability of our strategy for genomes with varying GC content and distinct taxonomic origin. We release iPtgxDBs for B. henselae, Bradyrhizobium diazoefficiens and Escherichia coli and the software to generate both proteogenomics search databases and integrated annotation files that can be viewed in a genome browser for any prokaryote.

Assuntos

Proteínas de Bactérias/genética , Bartonella henselae/genética , Bradyrhizobium/genética , Escherichia coli/genética , Genoma Bacteriano , Proteogenômica , Bases de Dados de Proteínas , Anotação de Sequência Molecular , Fases de Leitura Aberta , Software

3.

Pushing the limits of de novo genome assembly for complex prokaryotic genomes harboring very long, near identical repeats.

Schmid, Michael; Frei, Daniel; Patrignani, Andrea; Schlapbach, Ralph; Frey, Jürg E; Remus-Emsermann, Mitja N P; Ahrens, Christian H.

Nucleic Acids Res ; 46(17): 8953-8965, 2018 09 28.

Artigo em Inglês | MEDLINE | ID: mdl-30137508

RESUMO

Generating a complete, de novo genome assembly for prokaryotes is often considered a solved problem. However, we here show that Pseudomonas koreensis P19E3 harbors multiple, near identical repeat pairs up to 70 kilobase pairs in length, which contained several genes that may confer fitness advantages to the strain. Its complex genome, which also included a variable shufflon region, could not be de novo assembled with long reads produced by Pacific Biosciences' technology, but required very long reads from Oxford Nanopore Technologies. Importantly, a repeat analysis, whose results we release for over 9600 prokaryotes, indicated that very complex bacterial genomes represent a general phenomenon beyond Pseudomonas. Roughly 10% of 9331 complete bacterial and a handful of 293 complete archaeal genomes represented this 'dark matter' for de novo genome assembly of prokaryotes. Several of these 'dark matter' genome assemblies contained repeats far beyond the resolution of the sequencing technology employed and likely contain errors, other genomes were closed employing labor-intense steps like cosmid libraries, primer walking or optical mapping. Using very long sequencing reads in combination with assembly algorithms capable of resolving long, near identical repeats will bring most prokaryotic genomes within reach of fast and complete de novo genome assembly.

Assuntos

Algoritmos , Mapeamento Cromossômico/métodos , DNA Bacteriano/química , Genoma Bacteriano , Repetições de Microssatélites , Pseudomonas/genética , DNA Bacteriano/genética , DNA Bacteriano/metabolismo , Ontologia Genética , Aptidão Genética , Sequenciamento de Nucleotídeos em Larga Escala , Anotação de Sequência Molecular , Origanum/microbiologia , Filogenia , Folhas de Planta/microbiologia , Pseudomonas/classificação , Pseudomonas/isolamento & purificação , Pseudomonas/metabolismo , Pseudomonas aeruginosa/classificação , Pseudomonas aeruginosa/genética , Pseudomonas aeruginosa/isolamento & purificação , Pseudomonas aeruginosa/metabolismo , Pseudomonas putida/classificação , Pseudomonas putida/genética , Pseudomonas putida/isolamento & purificação , Pseudomonas putida/metabolismo

4.

Haplotype-resolved genomes of geminivirus-resistant and geminivirus-susceptible African cassava cultivars.

Kuon, Joel-E; Qi, Weihong; Schläpfer, Pascal; Hirsch-Hoffmann, Matthias; von Bieberstein, Philipp Rogalla; Patrignani, Andrea; Poveda, Lucy; Grob, Stefan; Keller, Miyako; Shimizu-Inatsugi, Rie; Grossniklaus, Ueli; Vanderschuren, Hervé; Gruissem, Wilhelm.

BMC Biol ; 17(1): 75, 2019 09 18.

Artigo em Inglês | MEDLINE | ID: mdl-31533702

RESUMO

BACKGROUND: Cassava is an important food crop in tropical and sub-tropical regions worldwide. In Africa, cassava production is widely affected by cassava mosaic disease (CMD), which is caused by the African cassava mosaic geminivirus that is transmitted by whiteflies. Cassava breeders often use a single locus, CMD2, for introducing CMD resistance into susceptible cultivars. The CMD2 locus has been genetically mapped to a 10-Mbp region, but its organization and genes as well as their functions are unknown. RESULTS: We report haplotype-resolved de novo assemblies and annotations of the genomes for the African cassava cultivar TME (tropical Manihot esculenta), which is the origin of CMD2, and the CMD-susceptible cultivar 60444. The assemblies provide phased haplotype information for over 80% of the genomes. Haplotype comparison identified novel features previously hidden in collapsed and fragmented cassava genomes, including thousands of allelic variants, inter-haplotype diversity in coding regions, and patterns of diversification through allele-specific expression. Reconstruction of the CMD2 locus revealed a highly complex region with nearly identical gene sets but limited microsynteny between the two cultivars. CONCLUSIONS: The genome maps of the CMD2 locus in both 60444 and TME3, together with the newly annotated genes, will help the identification of the causal genetic basis of CMD2 resistance to geminiviruses. Our de novo cassava genome assemblies will also facilitate genetic mapping approaches to narrow the large CMD2 region to a few candidate genes for better informed strategies to develop robust geminivirus resistance in susceptible cassava cultivars.

Assuntos

Resistência à Doença/genética , Haplótipos/genética , Manihot/genética , Doenças das Plantas/genética , Mapeamento Cromossômico/métodos , Suscetibilidade a Doenças , Geminiviridae , Predisposição Genética para Doença , Anotação de Sequência Molecular

5.

Cell Cycle Constraints and Environmental Control of Local DNA Hypomethylation in α-Proteobacteria.

Ardissone, Silvia; Redder, Peter; Russo, Giancarlo; Frandi, Antonio; Fumeaux, Coralie; Patrignani, Andrea; Schlapbach, Ralph; Falquet, Laurent; Viollier, Patrick H.

PLoS Genet ; 12(12): e1006499, 2016 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-27997543

RESUMO

Heritable DNA methylation imprints are ubiquitous and underlie genetic variability from bacteria to humans. In microbial genomes, DNA methylation has been implicated in gene transcription, DNA replication and repair, nucleoid segregation, transposition and virulence of pathogenic strains. Despite the importance of local (hypo)methylation at specific loci, how and when these patterns are established during the cell cycle remains poorly characterized. Taking advantage of the small genomes and the synchronizability of α-proteobacteria, we discovered that conserved determinants of the cell cycle transcriptional circuitry establish specific hypomethylation patterns in the cell cycle model system Caulobacter crescentus. We used genome-wide methyl-N6-adenine (m6A-) analyses by restriction-enzyme-cleavage sequencing (REC-Seq) and single-molecule real-time (SMRT) sequencing to show that MucR, a transcriptional regulator that represses virulence and cell cycle genes in S-phase but no longer in G1-phase, occludes 5'-GANTC-3' sequence motifs that are methylated by the DNA adenine methyltransferase CcrM. Constitutive expression of CcrM or heterologous methylases in at least two different α-proteobacteria homogenizes m6A patterns even when MucR is present and affects promoter activity. Environmental stress (phosphate limitation) can override and reconfigure local hypomethylation patterns imposed by the cell cycle circuitry that dictate when and where local hypomethylation is instated.

Assuntos

Caulobacter crescentus/genética , Ciclo Celular/genética , Metilação de DNA/genética , Transcrição Gênica , Divisão Celular/genética , Replicação do DNA/efeitos dos fármacos , Replicação do DNA/genética , Regulação Bacteriana da Expressão Gênica , Genoma Microbiano , Metiltransferases/genética , Fosfatos/metabolismo , Regiões Promotoras Genéticas , DNA Metiltransferases Sítio Específica (Adenina-Específica)/genética , Inanição/genética , Inanição/metabolismo

6.

New insights into the performance of human whole-exome capture platforms.

Meienberg, Janine; Zerjavic, Katja; Keller, Irene; Okoniewski, Michal; Patrignani, Andrea; Ludin, Katja; Xu, Zhenyu; Steinmann, Beat; Carrel, Thierry; Röthlisberger, Benno; Schlapbach, Ralph; Bruggmann, Rémy; Matyas, Gabor.

Nucleic Acids Res ; 43(11): e76, 2015 Jun 23.

Artigo em Inglês | MEDLINE | ID: mdl-25820422

RESUMO

Whole exome sequencing (WES) is increasingly used in research and diagnostics. WES users expect coverage of the entire coding region of known genes as well as sufficient read depth for the covered regions. It is, however, unknown which recent WES platform is most suitable to meet these expectations. We present insights into the performance of the most recent standard exome enrichment platforms from Agilent, NimbleGen and Illumina applied to six different DNA samples by two sequencing vendors per platform. Our results suggest that both Agilent and NimbleGen overall perform better than Illumina and that the high enrichment performance of Agilent is stable among samples and between vendors, whereas NimbleGen is only able to achieve vendor- and sample-specific best exome coverage. Moreover, the recent Agilent platform overall captures more coding exons with sufficient read depth than NimbleGen and Illumina. Due to considerable gaps in effective exome coverage, however, the three platforms cannot capture all known coding exons alone or in combination, requiring improvement. Our data emphasize the importance of evaluation of updated platform versions and suggest that enrichment-free whole genome sequencing can overcome the limitations of WES in sufficiently covering coding exons, especially GC-rich regions, and in characterizing structural variants.

Assuntos

Exoma , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Alelos , Composição de Bases , DNA/química , Humanos

7.

High-resolution community profiling of arbuscular mycorrhizal fungi.

Schlaeppi, Klaus; Bender, S Franz; Mascher, Fabio; Russo, Giancarlo; Patrignani, Andrea; Camenzind, Tessa; Hempel, Stefan; Rillig, Matthias C; van der Heijden, Marcel G A.

New Phytol ; 212(3): 780-791, 2016 Nov.

Artigo em Inglês | MEDLINE | ID: mdl-27381250

RESUMO

Community analyses of arbuscular mycorrhizal fungi (AMF) using ribosomal small subunit (SSU) or internal transcribed spacer (ITS) DNA sequences often suffer from low resolution or coverage. We developed a novel sequencing based approach for a highly resolving and specific profiling of AMF communities. We took advantage of previously established AMF-specific PCR primers that amplify a c. 1.5-kb long fragment covering parts of SSU, ITS and parts of the large ribosomal subunit (LSU), and we sequenced the resulting amplicons with single molecule real-time (SMRT) sequencing. The method was applicable to soil and root samples, detected all major AMF families and successfully discriminated closely related AMF species, which would not be discernible using SSU sequences. In inoculation tests we could trace the introduced AMF inoculum at the molecular level. One of the introduced strains almost replaced the local strain(s), revealing that AMF inoculation can have a profound impact on the native community. The methodology presented offers researchers a powerful new tool for AMF community analysis because it unifies improved specificity and enhanced resolution, whereas the drawback of medium sequencing throughput appears of lesser importance for low-diversity groups such as AMF.

Assuntos

Glomeromycota/fisiologia , Micorrizas/fisiologia , DNA Fúngico/genética , Óperon/genética , RNA Ribossômico/genética , Análise de Sequência de DNA , Microbiologia do Solo

8.

Full-length haplotype reconstruction to infer the structure of heterogeneous virus populations.

Giallonardo, Francesca Di; Töpfer, Armin; Rey, Melanie; Prabhakaran, Sandhya; Duport, Yannick; Leemann, Christine; Schmutz, Stefan; Campbell, Nottania K; Joos, Beda; Lecca, Maria Rita; Patrignani, Andrea; Däumer, Martin; Beisel, Christian; Rusert, Peter; Trkola, Alexandra; Günthard, Huldrych F; Roth, Volker; Beerenwinkel, Niko; Metzner, Karin J.

Nucleic Acids Res ; 42(14): e115, 2014 Aug.

Artigo em Inglês | MEDLINE | ID: mdl-24972832

RESUMO

Next-generation sequencing (NGS) technologies enable new insights into the diversity of virus populations within their hosts. Diversity estimation is currently restricted to single-nucleotide variants or to local fragments of no more than a few hundred nucleotides defined by the length of sequence reads. To study complex heterogeneous virus populations comprehensively, novel methods are required that allow for complete reconstruction of the individual viral haplotypes. Here, we show that assembly of whole viral genomes of â¼8600 nucleotides length is feasible from mixtures of heterogeneous HIV-1 strains derived from defined combinations of cloned virus strains and from clinical samples of an HIV-1 superinfected individual. Haplotype reconstruction was achieved using optimized experimental protocols and computational methods for amplification, sequencing and assembly. We comparatively assessed the performance of the three NGS platforms 454 Life Sciences/Roche, Illumina and Pacific Biosciences for this task. Our results prove and delineate the feasibility of NGS-based full-length viral haplotype reconstruction and provide new tools for studying evolution and pathogenesis of viruses.

Assuntos

Variação Genética , HIV-1/genética , Haplótipos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Genoma Viral , Infecções por HIV/virologia , Humanos

9.

Generation of SNP datasets for orangutan population genomics using improved reduced-representation sequencing and direct comparisons of SNP calling algorithms.

Greminger, Maja P; Stölting, Kai N; Nater, Alexander; Goossens, Benoit; Arora, Natasha; Bruggmann, Rémy; Patrignani, Andrea; Nussberger, Beatrice; Sharma, Reeta; Kraus, Robert H S; Ambu, Laurentius N; Singleton, Ian; Chikhi, Lounes; van Schaik, Carel P; Krützen, Michael.

BMC Genomics ; 15: 16, 2014 Jan 10.

Artigo em Inglês | MEDLINE | ID: mdl-24405840

RESUMO

BACKGROUND: High-throughput sequencing has opened up exciting possibilities in population and conservation genetics by enabling the assessment of genetic variation at genome-wide scales. One approach to reduce genome complexity, i.e. investigating only parts of the genome, is reduced-representation library (RRL) sequencing. Like similar approaches, RRL sequencing reduces ascertainment bias due to simultaneous discovery and genotyping of single-nucleotide polymorphisms (SNPs) and does not require reference genomes. Yet, generating such datasets remains challenging due to laboratory and bioinformatical issues. In the laboratory, current protocols require improvements with regards to sequencing homologous fragments to reduce the number of missing genotypes. From the bioinformatical perspective, the reliance of most studies on a single SNP caller disregards the possibility that different algorithms may produce disparate SNP datasets. RESULTS: We present an improved RRL (iRRL) protocol that maximizes the generation of homologous DNA sequences, thus achieving improved genotyping-by-sequencing efficiency. Our modifications facilitate generation of single-sample libraries, enabling individual genotype assignments instead of pooled-sample analysis. We sequenced ~1% of the orangutan genome with 41-fold median coverage in 31 wild-born individuals from two populations. SNPs and genotypes were called using three different algorithms. We obtained substantially different SNP datasets depending on the SNP caller. Genotype validations revealed that the Unified Genotyper of the Genome Analysis Toolkit and SAMtools performed significantly better than a caller from CLC Genomics Workbench (CLC). Of all conflicting genotype calls, CLC was only correct in 17% of the cases. Furthermore, conflicting genotypes between two algorithms showed a systematic bias in that one caller almost exclusively assigned heterozygotes, while the other one almost exclusively assigned homozygotes. CONCLUSIONS: Our enhanced iRRL approach greatly facilitates genotyping-by-sequencing and thus direct estimates of allele frequencies. Our direct comparison of three commonly used SNP callers emphasizes the need to question the accuracy of SNP and genotype calling, as we obtained considerably different SNP datasets depending on caller algorithms, sequencing depths and filtering criteria. These differences affected scans for signatures of natural selection, but will also exert undue influences on demographic inferences. This study presents the first effort to generate a population genomic dataset for wild-born orangutans with known population provenance.

Assuntos

Algoritmos , Bases de Dados Genéticas , Genoma , Genômica/métodos , Polimorfismo de Nucleotídeo Único , Pongo abelii/genética , Animais , Biologia Computacional , Frequência do Gene , Genótipo , Heterozigoto , Sequenciamento de Nucleotídeos em Larga Escala , Software

10.

Upregulation of axon guidance molecules in the adult central nervous system of Nogo-A knockout mice restricts neuronal growth and regeneration.

Kempf, Anissa; Montani, Laura; Petrinovic, Marija M; Schroeter, Aileen; Weinmann, Oliver; Patrignani, Andrea; Schwab, Martin E.

Eur J Neurosci ; 38(11): 3567-79, 2013 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-24103058

RESUMO

Adult central nervous system axons show restricted growth and regeneration properties after injury. One of the underlying mechanisms is the activation of the Nogo-A/Nogo receptor (NgR1) signaling pathway. Nogo-A knockout (KO) mice show enhanced regenerative growth in vivo, even though it is less pronounced than after acute antibody-mediated neutralization of Nogo-A. Residual inhibition may involve a compensatory component. By mRNA expression profiling and immunoblots we show increased expression of several members of the Ephrin/Eph and Semaphorin/Plexin families of axon guidance molecules, e.g. EphrinA3 and EphA4, in the intact spinal cord of adult Nogo-A KO vs. wild-type (WT) mice. EphrinA3 inhibits neurite outgrowth of EphA4-positive neurons in vitro. In addition, EphrinA3 KO myelin extracts are less growth-inhibitory than WT but more than Nogo-A KO myelin extracts. EphA4 KO cortical neurons show decreased growth inhibition on Nogo-A KO myelin as compared with WT neurons, supporting increased EphA4-mediated growth inhibition in Nogo-A KO mice. Consistently, in vivo, Nogo-A/EphA4 double KO mice show increased axonal sprouting and regeneration after spinal cord injury as compared with EphA4 KO mice. Our results reveal the upregulation of developmental axon guidance cues following constitutive Nogo-A deletion, e.g. the EphrinA3/EphA4 ligand/receptor pair, and support their role in restricting neurite outgrowth in the absence of Nogo-A.

Assuntos

Axônios/fisiologia , Córtex Cerebral/metabolismo , Gânglios Espinais/metabolismo , Proteínas da Mielina/metabolismo , Regeneração da Medula Espinal , Regulação para Cima , Animais , Axônios/metabolismo , Células Cultivadas , Córtex Cerebral/patologia , Córtex Cerebral/fisiologia , Efrina-A3/genética , Efrina-A3/metabolismo , Efrina-A4/genética , Efrina-A4/metabolismo , Gânglios Espinais/patologia , Gânglios Espinais/fisiologia , Camundongos , Camundongos Endogâmicos C57BL , Camundongos Knockout , Proteínas da Mielina/genética , Bainha de Mielina/genética , Bainha de Mielina/metabolismo , Proteínas Nogo , Tratos Piramidais/metabolismo , Tratos Piramidais/patologia , Tratos Piramidais/fisiologia , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Traumatismos da Medula Espinal/metabolismo

11.

The haplotype-resolved chromosome pairs of a heterozygous diploid African cassava cultivar reveal novel pan-genome and allele-specific transcriptome features.

Qi, Weihong; Lim, Yi-Wen; Patrignani, Andrea; Schläpfer, Pascal; Bratus-Neuenschwander, Anna; Grüter, Simon; Chanez, Christelle; Rodde, Nathalie; Prat, Elisa; Vautrin, Sonia; Fustier, Margaux-Alison; Pratas, Diogo; Schlapbach, Ralph; Gruissem, Wilhelm.

Gigascience ; 112022 03 24.

Artigo em Inglês | MEDLINE | ID: mdl-35333302

RESUMO

BACKGROUND: Cassava (Manihot esculenta) is an important clonally propagated food crop in tropical and subtropical regions worldwide. Genetic gain by molecular breeding has been limited, partially because cassava is a highly heterozygous crop with a repetitive and difficult-to-assemble genome. FINDINGS: Here we demonstrate that Pacific Biosciences high-fidelity (HiFi) sequencing reads, in combination with the assembler hifiasm, produced genome assemblies at near complete haplotype resolution with higher continuity and accuracy compared to conventional long sequencing reads. We present 2 chromosome-scale haploid genomes phased with Hi-C technology for the diploid African cassava variety TME204. With consensus accuracy >QV46, contig N50 >18 Mb, BUSCO completeness of 99%, and 35k phased gene loci, it is the most accurate, continuous, complete, and haplotype-resolved cassava genome assembly so far. Ab initio gene prediction with RNA-seq data and Iso-Seq transcripts identified abundant novel gene loci, with enriched functionality related to chromatin organization, meristem development, and cell responses. During tissue development, differentially expressed transcripts of different haplotype origins were enriched for different functionality. In each tissue, 20-30% of transcripts showed allele-specific expression (ASE) differences. ASE bias was often tissue specific and inconsistent across different tissues. Direction-shifting was observed in <2% of the ASE transcripts. Despite high gene synteny, the HiFi genome assembly revealed extensive chromosome rearrangements and abundant intra-genomic and inter-genomic divergent sequences, with large structural variations mostly related to LTR retrotransposons. We use the reference-quality assemblies to build a cassava pan-genome and demonstrate its importance in representing the genetic diversity of cassava for downstream reference-guided omics analysis and breeding. CONCLUSIONS: The phased and annotated chromosome pairs allow a systematic view of the heterozygous diploid genome organization in cassava with improved accuracy, completeness, and haplotype resolution. They will be a valuable resource for cassava breeding and research. Our study may also provide insights into developing cost-effective and efficient strategies for resolving complex genomes with high resolution, accuracy, and continuity.

Assuntos

Manihot , Alelos , Cromossomos , Diploide , Haplótipos , Manihot/genética , Melhoramento Vegetal , Análise de Sequência de DNA , Transcriptoma

12.

AGRONOMICS1: a new resource for Arabidopsis transcriptome profiling.

Rehrauer, Hubert; Aquino, Catharine; Gruissem, Wilhelm; Henz, Stefan R; Hilson, Pierre; Laubinger, Sascha; Naouar, Naira; Patrignani, Andrea; Rombauts, Stephane; Shu, Huan; Van de Peer, Yves; Vuylsteke, Marnik; Weigel, Detlef; Zeller, Georg; Hennig, Lars.

Plant Physiol ; 152(2): 487-99, 2010 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-20032078

RESUMO

Transcriptome profiling has become a routine tool in biology. For Arabidopsis (Arabidopsis thaliana), the Affymetrix ATH1 expression array is most commonly used, but it lacks about one-third of all annotated genes present in the reference strain. An alternative are tiling arrays, but previous designs have not allowed the simultaneous analysis of both strands on a single array. We introduce AGRONOMICS1, a new Affymetrix Arabidopsis microarray that contains the complete paths of both genome strands, with on average one 25mer probe per 35-bp genome sequence window. In addition, the new AGRONOMICS1 array contains all perfect match probes from the original ATH1 array, allowing for seamless integration of the very large existing ATH1 knowledge base. The AGRONOMICS1 array can be used for diverse functional genomics applications such as reliable expression profiling of more than 30,000 genes, detection of alternative splicing, and chromatin immunoprecipitation coupled to microarrays (ChIP-chip). Here, we describe the design of the array and compare its performance with that of the ATH1 array. We find results from both microarrays to be of similar quality, but AGRONOMICS1 arrays yield robust expression information for many more genes, as expected. Analysis of the ATH1 probes on AGRONOMICS1 arrays produces results that closely mirror those of ATH1 arrays. Finally, the AGRONOMICS1 array is shown to be useful for ChIP-chip experiments. We show that heterochromatic H3K9me2 is strongly confined to the gene body of target genes in euchromatic chromosome regions, suggesting that spreading of heterochromatin is limited outside of pericentromeric regions.

Assuntos

Arabidopsis/genética , Perfilação da Expressão Gênica/métodos , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Imunoprecipitação da Cromatina , Biologia Computacional , Sondas de DNA , Genes de Plantas , Genômica , RNA de Plantas/genética , Análise de Sequência de DNA

13.

An improved draft genome assembly of Meloidogyne graminicola IARI strain using long-read sequencing.

Somvanshi, Vishal Singh; Dash, Manoranjan; Bhat, Chaitra G; Budhwar, Roli; Godwin, Jeffrey; Shukla, Rohit N; Patrignani, Andrea; Schlapbach, Ralph; Rao, Uma.

Gene ; 793: 145748, 2021 Aug 15.

Artigo em Inglês | MEDLINE | ID: mdl-34077775

RESUMO

The rice root-knot nematode Meloidogyne graminicola is a major biotic stress for the rice crop under upland, rain-fed lowland and irrigated cultivation conditions. Here, we present an improved draft genome assembly of M. graminicola IARI strain using the long-read sequencing approach (PacBio Sequel platform). The assembled genome size was 36.86 Mb with 514 contigs and N50 value of 105 kb. BUSCO estimated the genome to be 88.6% complete. Meloidogyne graminicola genome contained 17.83% repeat elements and showed 14,062 protein-coding gene models, 4,974 conserved orthologous genes, 561 putative secreted proteins, 49 RNAi pathway genes, 1,853 proteins involved in pathogen-host interactions, 1,575 carbohydrate-active enzymes, and 32,138 microsatellites. Five of the carbohydrate-active enzymes were found only in M. graminicola genome and were not present in any other analysed root-knot nematode genome. Together with the previous two genome assemblies, this improved genome assembly would facilitate comparative and functional genomics for M. graminicola.

Assuntos

Genes de Helmintos , Genoma Helmíntico , Proteínas de Helminto/genética , Oryza/parasitologia , Tylenchoidea/genética , Animais , Ontologia Genética , Tamanho do Genoma , Proteínas de Helminto/classificação , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Repetições de Microssatélites , Anotação de Sequência Molecular , Fases de Leitura Aberta , Filogenia , Doenças das Plantas/parasitologia , Tylenchoidea/classificação

14.

Quantification of the spread of SARS-CoV-2 variant B.1.1.7 in Switzerland.

Chen, Chaoran; Nadeau, Sarah Ann; Topolsky, Ivan; Manceau, Marc; Huisman, Jana S; Jablonski, Kim Philipp; Fuhrmann, Lara; Dreifuss, David; Jahn, Katharina; Beckmann, Christiane; Redondo, Maurice; Noppen, Christoph; Risch, Lorenz; Risch, Martin; Wohlwend, Nadia; Kas, Sinem; Bodmer, Thomas; Roloff, Tim; Stange, Madlen; Egli, Adrian; Eckerle, Isabella; Kaiser, Laurent; Denes, Rebecca; Feldkamp, Mirjam; Nissen, Ina; Santacroce, Natascha; Burcklen, Elodie; Aquino, Catharine; de Gouvea, Andreia Cabral; Moccia, Maria Domenica; Grüter, Simon; Sykes, Timothy; Opitz, Lennart; White, Griffin; Neff, Laura; Popovic, Doris; Patrignani, Andrea; Tracy, Jay; Schlapbach, Ralph; Dermitzakis, Emmanouil T; Harshman, Keith; Xenarios, Ioannis; Pegeot, Henri; Cerutti, Lorenzo; Penet, Deborah; Blin, Anthony; Elies, Melyssa; Althaus, Christian L; Beisel, Christian; Beerenwinkel, Niko.

Epidemics ; 37: 100480, 2021 12.

Artigo em Inglês | MEDLINE | ID: mdl-34488035

RESUMO

BACKGROUND: In December 2020, the United Kingdom (UK) reported a SARS-CoV-2 Variant of Concern (VoC) which is now named B.1.1.7. Based on initial data from the UK and later data from other countries, this variant was estimated to have a transmission fitness advantage of around 40-80 % (Volz et al., 2021; Leung et al., 2021; Davies et al., 2021). AIM: This study aims to estimate the transmission fitness advantage and the effective reproductive number of B.1.1.7 through time based on data from Switzerland. METHODS: We generated whole genome sequences from 11.8 % of all confirmed SARS-CoV-2 cases in Switzerland between 14 December 2020 and 11 March 2021. Based on these data, we determine the daily frequency of the B.1.1.7 variant and quantify the variant's transmission fitness advantage on a national and a regional scale. RESULTS: We estimate B.1.1.7 had a transmission fitness advantage of 43-52 % compared to the other variants circulating in Switzerland during the study period. Further, we estimate B.1.1.7 had a reproductive number above 1 from 01 January 2021 until the end of the study period, compared to below 1 for the other variants. Specifically, we estimate the reproductive number for B.1.1.7 was 1.24 [1.07-1.41] from 01 January until 17 January 2021 and 1.18 [1.06-1.30] from 18 January until 01 March 2021 based on the whole genome sequencing data. From 10 March to 16 March 2021, once B.1.1.7 was dominant, we estimate the reproductive number was 1.14 [1.00-1.26] based on all confirmed cases. For reference, Switzerland applied more non-pharmaceutical interventions to combat SARS-CoV-2 on 18 January 2021 and lifted some measures again on 01 March 2021. CONCLUSION: The observed increase in B.1.1.7 frequency in Switzerland during the study period is as expected based on observations in the UK. In absolute numbers, B.1.1.7 increased exponentially with an estimated doubling time of around 2-3.5 weeks. To monitor the ongoing spread of B.1.1.7, our plots are available online.

Assuntos

COVID-19 , SARS-CoV-2 , Humanos , Suíça/epidemiologia , Reino Unido

15.

A comprehensive examination of Nanopore native RNA sequencing for characterization of complex transcriptomes.

Soneson, Charlotte; Yao, Yao; Bratus-Neuenschwander, Anna; Patrignani, Andrea; Robinson, Mark D; Hussain, Shobbir.

Nat Commun ; 10(1): 3359, 2019 07 31.

Artigo em Inglês | MEDLINE | ID: mdl-31366910

RESUMO

A platform for highly parallel direct sequencing of native RNA strands was recently described by Oxford Nanopore Technologies, but despite initial efforts it remains crucial to further investigate the technology for quantification of complex transcriptomes. Here we undertake native RNA sequencing of polyA + RNA from two human cell lines, analysing ~5.2 million aligned native RNA reads. To enable informative comparisons, we also perform relevant ONT direct cDNA- and Illumina-sequencing. We find that while native RNA sequencing does enable some of the anticipated advantages, key unexpected aspects currently hamper its performance, most notably the quite frequent inability to obtain full-length transcripts from single reads, as well as difficulties to unambiguously infer their true transcript of origin. While characterising issues that need to be addressed when investigating more complex transcriptomes, our study highlights that with some defined improvements, native RNA sequencing could be an important addition to the mammalian transcriptomics toolbox.

Assuntos

Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de RNA/métodos , Transcriptoma/genética , Sequência de Bases/genética , Linhagem Celular , DNA Complementar/genética , Células HEK293 , Humanos , Poli A/genética

16.

Ectopic Methylation of a Single Persistently Unmethylated CpG in the Promoter of the Vitellogenin Gene Abolishes Its Inducibility by Estrogen through Attenuation of Upstream Stimulating Factor Binding.

Kallenberger, Lia; Erb, Rachel; Kralickova, Lucie; Patrignani, Andrea; Stöckli, Esther; Jiricny, Josef.

Mol Cell Biol ; 39(23)2019 12 01.

Artigo em Inglês | MEDLINE | ID: mdl-31548262

RESUMO

The enhancer/promoter of the vitellogenin II gene (VTG) has been extensively studied as a model system of vertebrate transcriptional control. While deletion mutagenesis and in vivo footprinting identified the transcription factor (TF) binding sites governing its tissue specificity, DNase hypersensitivity and DNA methylation studies revealed the epigenetic changes accompanying its hormone-dependent activation. Moreover, upon induction with estrogen (E2), the region flanking the estrogen-responsive element (ERE) was reported to undergo active DNA demethylation. We now show that although the VTG ERE is methylated in embryonic chicken liver and in LMH/2A hepatocytes, its induction by E2 was not accompanied by extensive demethylation. In contrast, E2 failed to activate a VTG enhancer/promoter-controlled luciferase reporter gene methylated by SssI. Surprisingly, this inducibility difference could be traced not to the ERE but rather to a single CpG in an E-box (CACGTG) sequence upstream of the VTG TATA box, which is unmethylated in vivo but methylated by SssI. We demonstrate that this E-box binds the upstream stimulating factor USF1/2. Selective methylation of the CpG within this binding site with an E-box-specific DNA methyltransferase, Eco72IM, was sufficient to attenuate USF1/2 binding in vitro and abolish the hormone-induced transcription of the VTG gene in the reporter system.

Assuntos

Expressão Ectópica do Gene/genética , Receptor alfa de Estrogênio/genética , Vitelogeninas/genética , Animais , Sítios de Ligação , Linhagem Celular , Embrião de Galinha , Ilhas de CpG/genética , Metilação de DNA/genética , Proteínas de Ligação a DNA/metabolismo , DNA-Citosina Metilases/metabolismo , Expressão Ectópica do Gene/efeitos dos fármacos , Receptor alfa de Estrogênio/metabolismo , Estrogênios/metabolismo , Regulação da Expressão Gênica/efeitos dos fármacos , Genes Reporter , Humanos , Regiões Promotoras Genéticas/efeitos dos fármacos , Regiões Promotoras Genéticas/genética , Sequências Reguladoras de Ácido Nucleico , Fatores de Transcrição/metabolismo , Vitelogeninas/metabolismo

17.

New insights on Pseudoalteromonas haloplanktis TAC125 genome organization and benchmarks of genome assembly applications using next and third generation sequencing technologies.

Qi, Weihong; Colarusso, Andrea; Olombrada, Miriam; Parrilli, Ermenegilda; Patrignani, Andrea; Tutino, Maria Luisa; Toll-Riera, Macarena.

Sci Rep ; 9(1): 16444, 2019 11 11.

Artigo em Inglês | MEDLINE | ID: mdl-31712730

RESUMO

Pseudoalteromonas haloplanktis TAC125 is among the most commonly studied bacteria adapted to cold environments. Aside from its ecological relevance, P. haloplanktis has a potential use for biotechnological applications. Due to its importance, we decided to take advantage of next generation sequencing (Illumina) and third generation sequencing (PacBio and Oxford Nanopore) technologies to resequence its genome. The availability of a reference genome, obtained using whole genome shotgun sequencing, allowed us to study and compare the results obtained by the different technologies and draw useful conclusions for future de novo genome assembly projects. We found that assembly polishing using Illumina reads is needed to achieve a consensus accuracy over 99.9% when using Oxford Nanopore sequencing, but not in PacBio sequencing. However, the dependency of consensus accuracy on coverage is lower in Oxford Nanopore than in PacBio, suggesting that a cost-effective solution might be the use of low coverage Oxford Nanopore sequencing together with Illumina reads. Despite the differences in consensus accuracy, all sequencing technologies revealed the presence of a large plasmid, pMEGA, which was undiscovered until now. Among the most interesting features of pMEGA is the presence of a putative error-prone polymerase regulated through the SOS response. Aside from the characterization of the newly discovered plasmid, we confirmed the sequence of the small plasmid pMtBL and uncovered the presence of a potential partitioning system. Crucially, this study shows that the combination of next and third generation sequencing technologies give us an unprecedented opportunity to characterize our bacterial model organisms at a very detailed level.

Assuntos

Genoma Bacteriano , Genômica , Infecções por Bactérias Gram-Negativas/microbiologia , Sequenciamento de Nucleotídeos em Larga Escala , Pseudoalteromonas/genética , Organismos Aquáticos , Biologia Computacional/métodos , Genômica/métodos , Anotação de Sequência Molecular , Microbiologia da Água

18.

Multiple hybrid de novo genome assembly of finger millet, an orphan allotetraploid crop.

Hatakeyama, Masaomi; Aluri, Sirisha; Balachadran, Mathi Thumilan; Sivarajan, Sajeevan Radha; Patrignani, Andrea; Grüter, Simon; Poveda, Lucy; Shimizu-Inatsugi, Rie; Baeten, John; Francoijs, Kees-Jan; Nataraja, Karaba N; Reddy, Yellodu A Nanja; Phadnis, Shamprasad; Ravikumar, Ramapura L; Schlapbach, Ralph; Sreeman, Sheshshayee M; Shimizu, Kentaro K.

DNA Res ; 25(1): 39-47, 2018 Feb 01.

Artigo em Inglês | MEDLINE | ID: mdl-28985356

RESUMO

Finger millet (Eleusine coracana (L.) Gaertn) is an important crop for food security because of its tolerance to drought, which is expected to be exacerbated by global climate changes. Nevertheless, it is often classified as an orphan/underutilized crop because of the paucity of scientific attention. Among several small millets, finger millet is considered as an excellent source of essential nutrient elements, such as iron and zinc; hence, it has potential as an alternate coarse cereal. However, high-quality genome sequence data of finger millet are currently not available. One of the major problems encountered in the genome assembly of this species was its polyploidy, which hampers genome assembly compared with a diploid genome. To overcome this problem, we sequenced its genome using diverse technologies with sufficient coverage and assembled it via a novel multiple hybrid assembly workflow that combines next-generation with single-molecule sequencing, followed by whole-genome optical mapping using the Bionano Irys® system. The total number of scaffolds was 1,897 with an N50 length >2.6 Mb and detection of 96% of the universal single-copy orthologs. The majority of the homeologs were assembled separately. This indicates that the proposed workflow is applicable to the assembly of other allotetraploid genomes.

19.

Author Correction: Genome expansion and lineage-specific genetic innovations in the forest pathogenic fungi Armillaria.

Sipos, György; Prasanna, Arun N; Walter, Mathias C; O'Connor, Eoin; Bálint, Balázs; Krizsán, Krisztina; Kiss, Brigitta; Hess, Jaqueline; Varga, Torda; Slot, Jason; Riley, Robert; Bóka, Bettina; Rigling, Daniel; Barry, Kerrie; Lee, Juna; Mihaltcheva, Sirma; LaButti, Kurt; Lipzen, Anna; Waldron, Rose; Moloney, Nicola M; Sperisen, Christoph; Kredics, László; Vágvölgyi, Csaba; Patrignani, Andrea; Fitzpatrick, David; Nagy, István; Doyle, Sean; Anderson, James B; Grigoriev, Igor V; Güldener, Ulrich; Münsterkötter, Martin; Nagy, László G.

Nat Ecol Evol ; 2(3): 577, 2018 03.

Artigo em Inglês | MEDLINE | ID: mdl-29426895

RESUMO

In the version of this Article originally published, it was incorrectly stated that "16,687 protein-coding genes were inferred for the most recent common ancestor (MRCA) of Armillaria"; the value was incorrect and it should have read "15,787". This has now been corrected.

20.

Large genomic fibrillin-1 (FBN1) gene deletions provide evidence for true haploinsufficiency in Marfan syndrome.

Mátyás, Gábor; Alonso, Sira; Patrignani, Andrea; Marti, Myriam; Arnold, Eliane; Magyar, István; Henggeler, Caroline; Carrel, Thierry; Steinmann, Beat; Berger, Wolfgang.

Hum Genet ; 122(1): 23-32, 2007 Aug.

Artigo em Inglês | MEDLINE | ID: mdl-17492313

RESUMO

Mutations in the FBN1 gene are the major cause of Marfan syndrome (MFS), an autosomal dominant connective tissue disorder, which displays variable manifestations in the cardiovascular, ocular, and skeletal systems. Current molecular genetic testing of FBN1 may miss mutations in the promoter region or in other noncoding sequences as well as partial or complete gene deletions and duplications. In this study, we tested for copy number variations by successively applying multiplex ligation-dependent probe amplification (MLPA) and the Affymetrix Human Mapping 500 K Array Set, which contains probes for approximately 500,000 single-nucleotide polymorphisms (SNPs) across the genome. By analyzing genomic DNA of 101 unrelated individuals with MFS or related phenotypes in whom standard genetic testing detected no mutation, we identified FBN1 deletions in two patients with MFS. Our high-resolution approach narrowed down the deletion breakpoints. Subsequent sequencing of the junctional fragments revealed the deletion sizes of 26,887 and 302,580 bp, respectively. Surprisingly, both deletions affect the putative regulatory and promoter region of the FBN1 gene, strongly indicating that they abolish transcription of the deleted allele. This expectation of complete loss of function of one allele, i.e. true haploinsufficiency, was confirmed by transcript analyses. Our findings not only emphasize the importance of screening for large genomic rearrangements in comprehensive genetic testing of FBN1 but, importantly, also extend the molecular etiology of MFS by providing hitherto unreported evidence that true haploinsufficiency is sufficient to cause MFS.

Assuntos

Deleção de Genes , Perda de Heterozigosidade , Síndrome de Marfan/genética , Proteínas dos Microfilamentos/genética , Sequência de Bases , Quebra Cromossômica , Estudos de Coortes , Análise Mutacional de DNA/métodos , Fibrilina-1 , Fibrilinas , Testes Genéticos , Haplótipos , Humanos , Dados de Sequência Molecular , Polimorfismo de Nucleotídeo Único , RNA Mensageiro/análise

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA