Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 25
Filter
Add more filters










Publication year range
1.
Nature ; 604(7905): 310-315, 2022 04.
Article in English | MEDLINE | ID: mdl-35388217

ABSTRACT

Comprehensive genome annotation is essential to understand the impact of clinically relevant variants. However, the absence of a standard for clinical reporting and browser display complicates the process of consistent interpretation and reporting. To address these challenges, Ensembl/GENCODE1 and RefSeq2 launched a joint initiative, the Matched Annotation from NCBI and EMBL-EBI (MANE) collaboration, to converge on human gene and transcript annotation and to jointly define a high-value set of transcripts and corresponding proteins. Here, we describe the MANE transcript sets for use as universal standards for variant reporting and browser display. The MANE Select set identifies a representative transcript for each human protein-coding gene, whereas the MANE Plus Clinical set provides additional transcripts at loci where the Select transcripts alone are not sufficient to report all currently known clinical variants. Each MANE transcript represents an exact match between the exonic sequences of an Ensembl/GENCODE transcript and its counterpart in RefSeq such that the identifiers can be used synonymously. We have now released MANE Select transcripts for 97% of human protein-coding genes, including all American College of Medical Genetics and Genomics Secondary Findings list v3.0 (ref. 3) genes. MANE transcripts are accessible from major genome browsers and key resources. Widespread adoption of these transcript sets will increase the consistency of reporting, facilitate the exchange of data regardless of the annotation source and help to streamline clinical interpretation.


Subject(s)
Computational Biology , Databases, Genetic , Genomics , Genome , Humans , Information Dissemination , Molecular Sequence Annotation , National Library of Medicine (U.S.) , United States
2.
Nature ; 563(7732): 501-507, 2018 11.
Article in English | MEDLINE | ID: mdl-30429615

ABSTRACT

Female Aedes aegypti mosquitoes infect more than 400 million people each year with dangerous viral pathogens including dengue, yellow fever, Zika and chikungunya. Progress in understanding the biology of mosquitoes and developing the tools to fight them has been slowed by the lack of a high-quality genome assembly. Here we combine diverse technologies to produce the markedly improved, fully re-annotated AaegL5 genome assembly, and demonstrate how it accelerates mosquito science. We anchored physical and cytogenetic maps, doubled the number of known chemosensory ionotropic receptors that guide mosquitoes to human hosts and egg-laying sites, provided further insight into the size and composition of the sex-determining M locus, and revealed copy-number variation among glutathione S-transferase genes that are important for insecticide resistance. Using high-resolution quantitative trait locus and population genomic analyses, we mapped new candidates for dengue vector competence and insecticide resistance. AaegL5 will catalyse new biological insights and intervention strategies to fight this deadly disease vector.


Subject(s)
Aedes/genetics , Arbovirus Infections/virology , Arboviruses , Genome, Insect/genetics , Genomics/standards , Insect Control , Mosquito Vectors/genetics , Mosquito Vectors/virology , Aedes/virology , Animals , Arbovirus Infections/transmission , Arboviruses/isolation & purification , DNA Copy Number Variations/genetics , Dengue Virus/isolation & purification , Female , Genetic Variation/genetics , Genetics, Population , Glutathione Transferase/genetics , Insecticide Resistance/drug effects , Male , Molecular Sequence Annotation , Multigene Family/genetics , Pyrethrins/pharmacology , Reference Standards , Sex Determination Processes/genetics
3.
PLoS One ; 13(7): e0199169, 2018.
Article in English | MEDLINE | ID: mdl-29966003

ABSTRACT

Aspergillus flavus is a saprophytic fungus that infects corn, peanuts, tree nuts and other agriculturally important crops. Once the crop is infected the fungus has the potential to secrete one or more mycotoxins, the most carcinogenic of which is aflatoxin. Aflatoxin contaminated crops are deemed unfit for human or animal consumption, which results in both food and economic losses. Within A. flavus, two morphotypes exist: the S strains (small sclerotia) and L strains (large sclerotia). Significant morphological and physiological differences exist between the two morphotypes. For example, the S-morphotypes produces sclerotia that are smaller (< 400 µm), greater in quantity, and contain higher concentrations of aflatoxin than the L-morphotypes (>400 µm). The morphotypes also differ in pigmentation, pH homeostasis in culture and the number of spores produced. Here we report the first full genome sequence of an A. flavus S morphotype, strain AF70. We provide a comprehensive comparison of the A. flavus S-morphotype genome sequence with a previously sequenced genome of an L-morphotype strain (NRRL 3357), including an in-depth analysis of secondary metabolic clusters and the identification SNPs within their aflatoxin gene clusters.


Subject(s)
Aspergillus flavus/genetics , Genome, Fungal/genetics , Plant Diseases/genetics , Spores, Fungal/genetics , Aflatoxins/genetics , Aflatoxins/toxicity , Arachis/microbiology , Aspergillus flavus/classification , Aspergillus flavus/pathogenicity , Crops, Agricultural/genetics , Crops, Agricultural/microbiology , Nuts/microbiology , Plant Diseases/microbiology , Spores, Fungal/pathogenicity , Zea mays/microbiology
4.
Nucleic Acids Res ; 46(D1): D221-D228, 2018 01 04.
Article in English | MEDLINE | ID: mdl-29126148

ABSTRACT

The Consensus Coding Sequence (CCDS) project provides a dataset of protein-coding regions that are identically annotated on the human and mouse reference genome assembly in genome annotations produced independently by NCBI and the Ensembl group at EMBL-EBI. This dataset is the product of an international collaboration that includes NCBI, Ensembl, HUGO Gene Nomenclature Committee, Mouse Genome Informatics and University of California, Santa Cruz. Identically annotated coding regions, which are generated using an automated pipeline and pass multiple quality assurance checks, are assigned a stable and tracked identifier (CCDS ID). Additionally, coordinated manual review by expert curators from the CCDS collaboration helps in maintaining the integrity and high quality of the dataset. The CCDS data are available through an interactive web page (https://www.ncbi.nlm.nih.gov/CCDS/CcdsBrowse.cgi) and an FTP site (ftp://ftp.ncbi.nlm.nih.gov/pub/CCDS/). In this paper, we outline the ongoing work, growth and stability of the CCDS dataset and provide updates on new collaboration members and new features added to the CCDS user interface. We also present expert curation scenarios, with specific examples highlighting the importance of an accurate reference genome assembly and the crucial role played by input from the research community.


Subject(s)
Consensus Sequence , Databases, Genetic , Open Reading Frames , Animals , Data Curation/methods , Data Curation/standards , Databases, Genetic/standards , Guidelines as Topic , Humans , Mice , Molecular Sequence Annotation , National Library of Medicine (U.S.) , United States , User-Computer Interface
5.
Nat Commun ; 7: 10507, 2016 Feb 09.
Article in English | MEDLINE | ID: mdl-26856261

ABSTRACT

Ticks transmit more pathogens to humans and animals than any other arthropod. We describe the 2.1 Gbp nuclear genome of the tick, Ixodes scapularis (Say), which vectors pathogens that cause Lyme disease, human granulocytic anaplasmosis, babesiosis and other diseases. The large genome reflects accumulation of repetitive DNA, new lineages of retro-transposons, and gene architecture patterns resembling ancient metazoans rather than pancrustaceans. Annotation of scaffolds representing ∼57% of the genome, reveals 20,486 protein-coding genes and expansions of gene families associated with tick-host interactions. We report insights from genome analyses into parasitic processes unique to ticks, including host 'questing', prolonged feeding, cuticle synthesis, blood meal concentration, novel methods of haemoglobin digestion, haem detoxification, vitellogenesis and prolonged off-host survival. We identify proteins associated with the agent of human granulocytic anaplasmosis, an emerging disease, and the encephalitis-causing Langat virus, and a population structure correlated to life-history traits and transmission of the Lyme disease agent.


Subject(s)
Anaplasma phagocytophilum , Arachnid Vectors/genetics , Genome/genetics , Ixodes/genetics , Ligand-Gated Ion Channels/genetics , Animals , Gene Expression Profiling , Genomics , Lyme Disease/transmission , Oocytes , Xenopus laevis
6.
Nucleic Acids Res ; 44(D1): D733-45, 2016 Jan 04.
Article in English | MEDLINE | ID: mdl-26553804

ABSTRACT

The RefSeq project at the National Center for Biotechnology Information (NCBI) maintains and curates a publicly available database of annotated genomic, transcript, and protein sequence records (http://www.ncbi.nlm.nih.gov/refseq/). The RefSeq project leverages the data submitted to the International Nucleotide Sequence Database Collaboration (INSDC) against a combination of computation, manual curation, and collaboration to produce a standard set of stable, non-redundant reference sequences. The RefSeq project augments these reference sequences with current knowledge including publications, functional features and informative nomenclature. The database currently represents sequences from more than 55,000 organisms (>4800 viruses, >40,000 prokaryotes and >10,000 eukaryotes; RefSeq release 71), ranging from a single record to complete genomes. This paper summarizes the current status of the viral, prokaryotic, and eukaryotic branches of the RefSeq project, reports on improvements to data access and details efforts to further expand the taxonomic representation of the collection. We also highlight diverse functional curation initiatives that support multiple uses of RefSeq data including taxonomic validation, genome annotation, comparative genomics, and clinical testing. We summarize our approach to utilizing available RNA-Seq and other data types in our manual curation process for vertebrate, plant, and other species, and describe a new direction for prokaryotic genomes and protein name management.


Subject(s)
Databases, Genetic , Genomics , Animals , Cattle , Gene Expression Profiling , Genome, Fungal , Genome, Human , Genome, Microbial , Genome, Plant , Genome, Viral , Genomics/standards , Humans , Invertebrates/genetics , Mice , Molecular Sequence Annotation , Nematoda/genetics , Phylogeny , RNA, Long Noncoding/genetics , Rats , Reference Standards , Sequence Analysis, Protein , Sequence Analysis, RNA , Vertebrates/genetics
7.
Mamm Genome ; 26(9-10): 379-90, 2015 Oct.
Article in English | MEDLINE | ID: mdl-26215545

ABSTRACT

Complete and accurate annotation of the mouse genome is critical to the advancement of research conducted on this important model organism. The National Center for Biotechnology Information (NCBI) develops and maintains many useful resources to assist the mouse research community. In particular, the reference sequence (RefSeq) database provides high-quality annotation of multiple mouse genome assemblies using a combinatorial approach that leverages computation, manual curation, and collaboration. Implementation of this conservative and rigorous approach, which focuses on representation of only full-length and non-redundant data, produces high-quality annotation products. RefSeq records explicitly link sequences to current knowledge in a timely manner, updating public records regularly and rapidly in response to nomenclature updates, addition of new relevant publications, collaborator discussion, and user feedback. Whole genome re-annotation is also conducted at least every 12-18 months, and often more frequently in response to assembly updates or availability of informative data. This article highlights key features and advantages of RefSeq genome annotation products and presents an overview of NCBI processes to generate these data. Further discussion of NCBI's resources highlights useful features and the best methods for accessing our data.


Subject(s)
Amino Acid Sequence/genetics , Databases, Genetic , Databases, Nucleic Acid , Genome , Animals , Internet , Mice
8.
PLoS Pathog ; 11(4): e1004834, 2015 Apr.
Article in English | MEDLINE | ID: mdl-25909486

ABSTRACT

Invasive aspergillosis (IA) due to Aspergillus fumigatus is a major cause of mortality in immunocompromised patients. The discovery of highly fertile strains of A. fumigatus opened the possibility to merge classical and contemporary genetics to address key questions about this pathogen. The merger involves sexual recombination, selection of desired traits, and genomics to identify any associated loci. We constructed a highly fertile isogenic pair of A. fumigatus strains with opposite mating types and used them to investigate whether mating type is associated with virulence and to find the genetic loci involved in azole resistance. The pair was made isogenic by 9 successive backcross cycles of the foundational strain AFB62 (MAT1-1) with a highly fertile (MAT1-2) progeny. Genome sequencing showed that the F9 MAT1-2 progeny was essentially identical to the AFB62. The survival curves of animals infected with either strain in three different animal models showed no significant difference, suggesting that virulence in A. fumigatus was not associated with mating type. We then employed a relatively inexpensive, yet highly powerful strategy to identify genomic loci associated with azole resistance. We used traditional in vitro drug selection accompanied by classical sexual crosses of azole-sensitive with resistant isogenic strains. The offspring were plated under varying drug concentrations and pools of resulting colonies were analyzed by whole genome sequencing. We found that variants in 5 genes contributed to azole resistance, including mutations in erg11A (cyp51A), as well as multi-drug transporters, erg25, and in HMG-CoA reductase. The results demonstrated that with minimal investment into the sequencing of three pools from a cross of interest, the variation(s) that contribute any phenotype can be identified with nucleotide resolution. This approach can be applied to multiple areas of interest in A. fumigatus or other heterothallic pathogens, especially for virulence associated traits.


Subject(s)
Antifungal Agents/pharmacology , Aspergillus fumigatus/drug effects , Azoles/pharmacology , Drug Resistance, Multiple, Fungal , Hydroxymethylglutaryl CoA Reductases/metabolism , Mixed Function Oxygenases/metabolism , Sterol 14-Demethylase/metabolism , Animals , Antifungal Agents/therapeutic use , Aspergillosis/drug therapy , Aspergillosis/microbiology , Aspergillosis/pathology , Aspergillus fumigatus/isolation & purification , Aspergillus fumigatus/metabolism , Aspergillus fumigatus/pathogenicity , Azoles/therapeutic use , Crosses, Genetic , Drug Resistance, Multiple, Fungal/drug effects , Fungal Proteins/genetics , Fungal Proteins/metabolism , Genes, Mating Type, Fungal/drug effects , Genetic Loci/drug effects , Hydroxymethylglutaryl CoA Reductases/genetics , Itraconazole/pharmacology , Itraconazole/therapeutic use , Larva/drug effects , Mice, Inbred BALB C , Microbial Sensitivity Tests , Mixed Function Oxygenases/genetics , Moths/drug effects , Mutation , Sterol 14-Demethylase/genetics , Survival Analysis , Triazoles/pharmacology , Triazoles/therapeutic use , Virulence/drug effects , Voriconazole/pharmacology , Voriconazole/therapeutic use
9.
Genome Announc ; 2(5)2014 Oct 30.
Article in English | MEDLINE | ID: mdl-25359908

ABSTRACT

The soil fungus Rhizoctonia solani is a pathogen of agricultural crops. Here, we report on the 51,705,945 bp draft consensus genome sequence of R. solani strain Rhs1AP. A comprehensive understanding of the heterokaryotic genome complexity and organization of R. solani may provide insight into the plant disease ecology and adaptive behavior of the fungus.

10.
Mycopathologia ; 178(5-6): 331-9, 2014 Dec.
Article in English | MEDLINE | ID: mdl-24996522

ABSTRACT

We utilized RNAseq analysis of the Aspergillus fumigatus response to early hypoxic condition exposure. The results show that more than 89% of the A. fumigatus genome is expressed under normoxic and hypoxic conditions. Replicate samples were highly reproducible; however, comparisons between normoxia and hypoxia revealed that >23 and 35% of genes were differentially expressed after 30 and 120 min of hypoxia exposure, respectively. Consistent with our previous report detailing transcriptomic and proteomic responses at later time points, the results here show major repression of ribosomal function and induction of ergosterol biosynthesis, as well as activation of alternate respiratory mechanisms at the later time point. RNAseq data were used to define 32 hypoxia-specific genes, which were not expressed under normoxic conditions. Transcripts of a C6 transcription factor and a histidine kinase-response regulator were found only in hypoxia. In addition, several genes involved in the phosphoenylpyruvate and D-glyceraldehyde-3-phosphate metabolism were only expressed in hypoxia. Interestingly, a 216-bp ncRNA Afu-182 in the 3' region of insA (AFUB_064770) was significantly repressed under hypoxia with a 40-fold reduction in expression. A detailed analysis of Afu-182 showed similarity with several genes in the genome, many of which were also repressed in hypoxia. The results from this study show that hypoxia induces very early and widely drastic genome-wide responses in A. fumigatus that include expression of protein-coding and ncRNA genes. The role of these ncRNA genes in regulating the fungal hypoxia response is an exciting future research direction.


Subject(s)
Aspergillus fumigatus/physiology , Gene Expression Regulation, Fungal , RNA, Untranslated/metabolism , Stress, Physiological , Anaerobiosis , Aspergillus fumigatus/genetics , Gene Expression Profiling , Sequence Analysis, DNA
11.
FEMS Microbiol Lett ; 352(2): 165-73, 2014 Mar.
Article in English | MEDLINE | ID: mdl-24461055

ABSTRACT

The soil fungus Rhizoctonia solani is an economically important pathogen of agricultural and forestry crops. Here, we present the complete sequence and analysis of the mitochondrial genome of R. solani, field isolate Rhs1AP. The genome (235 849 bp) is the largest mitochondrial genome of a filamentous fungus sequenced to date and exhibits a rich accumulation of introns, novel repeat sequences, homing endonuclease genes, and hypothetical genes. Stable secondary structures exhibited by repeat sequences suggest that they comprise functional, possibly catalytic RNA elements. RNA-Seq expression profiling confirmed that the majority of homing endonuclease genes and hypothetical genes are transcriptionally active. Comparative analysis suggests that the mitochondrial genome of R. solani is an example of a dynamic history of expansion in filamentous fungi.


Subject(s)
Genome, Mitochondrial/genetics , Interspersed Repetitive Sequences , Rhizoctonia/genetics , Sequence Analysis, DNA , DNA, Fungal/chemistry , DNA, Fungal/genetics , DNA, Mitochondrial/chemistry , DNA, Mitochondrial/genetics , Evolution, Molecular , Gene Expression Profiling , Molecular Sequence Data
12.
BMC Genomics ; 13: 698, 2012 Dec 12.
Article in English | MEDLINE | ID: mdl-23234273

ABSTRACT

BACKGROUND: The genera Aspergillus and Penicillium include some of the most beneficial as well as the most harmful fungal species such as the penicillin-producer Penicillium chrysogenum and the human pathogen Aspergillus fumigatus, respectively. Their mitochondrial genomic sequences may hold vital clues into the mechanisms of their evolution, population genetics, and biology, yet only a handful of these genomes have been fully sequenced and annotated. RESULTS: Here we report the complete sequence and annotation of the mitochondrial genomes of six Aspergillus and three Penicillium species: A. fumigatus, A. clavatus, A. oryzae, A. flavus, Neosartorya fischeri (A. fischerianus), A. terreus, P. chrysogenum, P. marneffei, and Talaromyces stipitatus (P. stipitatum). The accompanying comparative analysis of these and related publicly available mitochondrial genomes reveals wide variation in size (25-36 Kb) among these closely related fungi. The sources of genome expansion include group I introns and accessory genes encoding putative homing endonucleases, DNA and RNA polymerases (presumed to be of plasmid origin) and hypothetical proteins. The two smallest sequenced genomes (A. terreus and P. chrysogenum) do not contain introns in protein-coding genes, whereas the largest genome (T. stipitatus), contains a total of eleven introns. All of the sequenced genomes have a group I intron in the large ribosomal subunit RNA gene, suggesting that this intron is fixed in these species. Subsequent analysis of several A. fumigatus strains showed low intraspecies variation. This study also includes a phylogenetic analysis based on 14 concatenated core mitochondrial proteins. The phylogenetic tree has a different topology from published multilocus trees, highlighting the challenges still facing the Aspergillus systematics. CONCLUSIONS: The study expands the genomic resources available to fungal biologists by providing mitochondrial genomes with consistent annotations for future genetic, evolutionary and population studies. Despite the conservation of the core genes, the mitochondrial genomes of Aspergillus and Penicillium species examined here exhibit significant amount of interspecies variation. Most of this variation can be attributed to accessory genes and mobile introns, presumably acquired by horizontal gene transfer of mitochondrial plasmids and intron homing.


Subject(s)
Aspergillus/genetics , Genes, Fungal/genetics , Genome Size/genetics , Genome, Mitochondrial/genetics , Introns/genetics , Penicillium/genetics , Sequence Analysis , Base Sequence , Evolution, Molecular , Genes, Mitochondrial/genetics , Genetic Variation/genetics , Molecular Sequence Annotation , Mutagenesis, Insertional/genetics , Phylogeny , Plasmids/genetics
13.
J Bacteriol ; 194(2): 376-94, 2012 Jan.
Article in English | MEDLINE | ID: mdl-22056929

ABSTRACT

We present the draft genome for the Rickettsia endosymbiont of Ixodes scapularis (REIS), a symbiont of the deer tick vector of Lyme disease in North America. Among Rickettsia species (Alphaproteobacteria: Rickettsiales), REIS has the largest genome sequenced to date (>2 Mb) and contains 2,309 genes across the chromosome and four plasmids (pREIS1 to pREIS4). The most remarkable finding within the REIS genome is the extraordinary proliferation of mobile genetic elements (MGEs), which contributes to a limited synteny with other Rickettsia genomes. In particular, an integrative conjugative element named RAGE (for Rickettsiales amplified genetic element), previously identified in scrub typhus rickettsiae (Orientia tsutsugamushi) genomes, is present on both the REIS chromosome and plasmids. Unlike the pseudogene-laden RAGEs of O. tsutsugamushi, REIS encodes nine conserved RAGEs that include F-like type IV secretion systems similar to that of the tra genes encoded in the Rickettsia bellii and R. massiliae genomes. An unparalleled abundance of encoded transposases (>650) relative to genome size, together with the RAGEs and other MGEs, comprise ~35% of the total genome, making REIS one of the most plastic and repetitive bacterial genomes sequenced to date. We present evidence that conserved rickettsial genes associated with an intracellular lifestyle were acquired via MGEs, especially the RAGE, through a continuum of genomic invasions. Robust phylogeny estimation suggests REIS is ancestral to the virulent spotted fever group of rickettsiae. As REIS is not known to invade vertebrate cells and has no known pathogenic effects on I. scapularis, its genome sequence provides insight on the origin of mechanisms of rickettsial pathogenicity.


Subject(s)
Gene Expression Regulation, Bacterial/physiology , Genome, Bacterial , Interspersed Repetitive Sequences , Ixodes/microbiology , Rickettsia/genetics , Animals , Arachnid Vectors/microbiology , Biological Evolution , Chromosome Mapping , Chromosomes, Bacterial , Molecular Sequence Data , Plasmids , Symbiosis
14.
Genome Biol ; 12(10): R100, 2011 Oct 17.
Article in English | MEDLINE | ID: mdl-22004680

ABSTRACT

BACKGROUND: Ichthyophthirius multifiliis, commonly known as Ich, is a highly pathogenic ciliate responsible for 'white spot', a disease causing significant economic losses to the global aquaculture industry. Options for disease control are extremely limited, and Ich's obligate parasitic lifestyle makes experimental studies challenging. Unlike most well-studied protozoan parasites, Ich belongs to a phylum composed primarily of free-living members. Indeed, it is closely related to the model organism Tetrahymena thermophila. Genomic studies represent a promising strategy to reduce the impact of this disease and to understand the evolutionary transition to parasitism. RESULTS: We report the sequencing, assembly and annotation of the Ich macronuclear genome. Compared with its free-living relative T. thermophila, the Ich genome is reduced approximately two-fold in length and gene density and three-fold in gene content. We analyzed in detail several gene classes with diverse functions in behavior, cellular function and host immunogenicity, including protein kinases, membrane transporters, proteases, surface antigens and cytoskeletal components and regulators. We also mapped by orthology Ich's metabolic pathways in comparison with other ciliates and a potential host organism, the zebrafish Danio rerio. CONCLUSIONS: Knowledge of the complete protein-coding and metabolic potential of Ich opens avenues for rational testing of therapeutic drugs that target functions essential to this parasite but not to its fish hosts. Also, a catalog of surface protein-encoding genes will facilitate development of more effective vaccines. The potential to use T. thermophila as a surrogate model offers promise toward controlling 'white spot' disease and understanding the adaptation to a parasitic lifestyle.


Subject(s)
Ciliophora Infections/prevention & control , Genomics/methods , Hymenostomatida/genetics , Life Cycle Stages , Zebrafish/parasitology , Animals , Antigens, Protozoan/genetics , Base Composition , Chromosome Mapping , DNA, Mitochondrial/genetics , DNA, Protozoan/genetics , Databases, Genetic , Genes, Protozoan , Genome Size , Host-Parasite Interactions , Hymenostomatida/classification , Hymenostomatida/growth & development , Hymenostomatida/pathogenicity , Ictaluridae/parasitology , Macronucleus/genetics , Membrane Transport Proteins/genetics , Metabolic Networks and Pathways , Mitochondria/enzymology , Mitochondria/genetics , Mitochondrial Proton-Translocating ATPases/genetics , Molecular Sequence Annotation , Phylogeny , Protein Kinases/classification , Protein Kinases/genetics , Protozoan Proteins/genetics , RNA, Protozoan/genetics , Zebrafish/genetics
15.
Fungal Genet Biol ; 46 Suppl 1: S2-13, 2009 Mar.
Article in English | MEDLINE | ID: mdl-19146970

ABSTRACT

The identification and annotation of protein-coding genes is one of the primary goals of whole-genome sequencing projects, and the accuracy of predicting the primary protein products of gene expression is vital to the interpretation of the available data and the design of downstream functional applications. Nevertheless, the comprehensive annotation of eukaryotic genomes remains a considerable challenge. Many genomes submitted to public databases, including those of major model organisms, contain significant numbers of wrong and incomplete gene predictions. We present a community-based reannotation of the Aspergillus nidulans genome with the primary goal of increasing the number and quality of protein functional assignments through the careful review of experts in the field of fungal biology.


Subject(s)
Aspergillus nidulans/genetics , Fungal Proteins/genetics , Genes, Fungal , Genome, Fungal , Genomics , Aspergillus nidulans/physiology
16.
Nat Biotechnol ; 26(10): 1161-8, 2008 Oct.
Article in English | MEDLINE | ID: mdl-18820685

ABSTRACT

Industrial penicillin production with the filamentous fungus Penicillium chrysogenum is based on an unprecedented effort in microbial strain improvement. To gain more insight into penicillin synthesis, we sequenced the 32.19 Mb genome of P. chrysogenum Wisconsin54-1255 and identified numerous genes responsible for key steps in penicillin production. DNA microarrays were used to compare the transcriptomes of the sequenced strain and a penicillinG high-producing strain, grown in the presence and absence of the side-chain precursor phenylacetic acid. Transcription of genes involved in biosynthesis of valine, cysteine and alpha-aminoadipic acid-precursors for penicillin biosynthesis-as well as of genes encoding microbody proteins, was increased in the high-producing strain. Some gene products were shown to be directly controlling beta-lactam output. Many key cellular transport processes involving penicillins and intermediates remain to be characterized at the molecular level. Genes predicted to encode transporters were strongly overrepresented among the genes transcriptionally upregulated under conditions that stimulate penicillinG production, illustrating potential for future genomics-driven metabolic engineering.


Subject(s)
Chromosome Mapping/methods , Fungal Proteins/genetics , Genome, Fungal/genetics , Penicillin G/metabolism , Penicillium chrysogenum/genetics , Transcription Factors/genetics , Base Sequence , Molecular Sequence Data , Sequence Analysis, DNA/methods
17.
PLoS Genet ; 4(4): e1000046, 2008 Apr 11.
Article in English | MEDLINE | ID: mdl-18404212

ABSTRACT

We present the genome sequences of a new clinical isolate of the important human pathogen, Aspergillus fumigatus, A1163, and two closely related but rarely pathogenic species, Neosartorya fischeri NRRL181 and Aspergillus clavatus NRRL1. Comparative genomic analysis of A1163 with the recently sequenced A. fumigatus isolate Af293 has identified core, variable and up to 2% unique genes in each genome. While the core genes are 99.8% identical at the nucleotide level, identity for variable genes can be as low 40%. The most divergent loci appear to contain heterokaryon incompatibility (het) genes associated with fungal programmed cell death such as developmental regulator rosA. Cross-species comparison has revealed that 8.5%, 13.5% and 12.6%, respectively, of A. fumigatus, N. fischeri and A. clavatus genes are species-specific. These genes are significantly smaller in size than core genes, contain fewer exons and exhibit a subtelomeric bias. Most of them cluster together in 13 chromosomal islands, which are enriched for pseudogenes, transposons and other repetitive elements. At least 20% of A. fumigatus-specific genes appear to be functional and involved in carbohydrate and chitin catabolism, transport, detoxification, secondary metabolism and other functions that may facilitate the adaptation to heterogeneous environments such as soil or a mammalian host. Contrary to what was suggested previously, their origin cannot be attributed to horizontal gene transfer (HGT), but instead is likely to involve duplication, diversification and differential gene loss (DDL). The role of duplication in the origin of lineage-specific genes is further underlined by the discovery of genomic islands that seem to function as designated "gene dumps" and, perhaps, simultaneously, as "gene factories".


Subject(s)
Aspergillus fumigatus/genetics , Genomic Islands , Allergens/genetics , Aspergillus/classification , Aspergillus/genetics , Aspergillus/physiology , Aspergillus fumigatus/classification , Aspergillus fumigatus/pathogenicity , Aspergillus fumigatus/physiology , Chromosomes, Fungal/genetics , Eurotiales/classification , Eurotiales/genetics , Eurotiales/physiology , Evolution, Molecular , Fungal Proteins/genetics , Fungal Proteins/immunology , Genome, Fungal , Humans , Phylogeny , Species Specificity , Virulence/genetics
18.
Mol Plant Microbe Interact ; 19(11): 1193-206, 2006 Nov.
Article in English | MEDLINE | ID: mdl-17073302

ABSTRACT

The ability of Pseudomonas syringae pv. phaseolicola to cause halo blight of bean is dependent on its ability to translocate effector proteins into host cells via the hypersensitive response and pathogenicity (Hrp) type III secretion system (T3SS). To identify genes encoding type III effectors and other potential virulence factors that are regulated by the HrpL alternative sigma factor, we used a hidden Markov model, weight matrix model, and type III targeting-associated patterns to search the genome of P. syringae pv. phaseolicola 1448A, which recently was sequenced to completion. We identified 44 high-probability putative Hrp promoters upstream of genes encoding the core T3SS machinery, 27 candidate effectors and related T3SS substrates, and 10 factors unrelated to the Hrp system. The expression of 13 of these candidate HrpL regulon genes was analyzed by real-time polymerase chain reaction, and all were found to be upregulated by HrpL. Six of the candidate type III effectors were assayed for T3SS-dependent translocation into plant cells using the Bordetella pertussis calmodulin-dependent adenylate cyclase (Cya) translocation reporter, and all were translocated. PSPPH1855 (ApbE-family protein) and PSPPH3759 (alcohol dehydrogenase) have no apparent T3SS-related function; however, they do have homologs in the model strain P. syringae pv. tomato DC3000 (PSPTO2105 and PSPTO0834, respectively) that are similarly upregulated by HrpL. Mutations were constructed in the DC3000 homologs and found to reduce bacterial growth in host Arabidopsis leaves. These results establish the utility of the bioinformatic or candidate gene approach to identifying effectors and other genes relevant to pathogenesis in P. syringae genomes.


Subject(s)
Bacterial Proteins/genetics , DNA-Binding Proteins/genetics , Gene Expression Regulation, Bacterial , Pseudomonas syringae/genetics , Regulon , Sigma Factor/genetics , Adenylyl Cyclases/genetics , Arabidopsis , Computational Biology/methods , Genes, Reporter , Markov Chains , Mutation , Polymerase Chain Reaction , Promoter Regions, Genetic , Pseudomonas syringae/pathogenicity , Nicotiana , Translocation, Genetic , Virulence/genetics
19.
Mol Plant Microbe Interact ; 19(7): 789-800, 2006 Jul.
Article in English | MEDLINE | ID: mdl-16838791

ABSTRACT

Many plant pathogens suppress antimicrobial defenses using virulence factors that modulate endogenous host defenses. The Pseudomonas syringae phytotoxin coronatine (COR) is believed to promote virulence by acting as a jasmonate analog, because COR-insensitive 1 (coil) Arabidopsis thaliana and tomato mutants are impaired in jasmonate signaling and exhibit reduced susceptibility to P. syringae. To further investigate the role of jasmonate signaling in disease development, we analyzed several jasmonate-insensitive A. thaliana mutants for susceptibility to P. syringae pv. tomato strain DC3000 and sensitivity to COR. Jasmonate-insensitive 1 (jin1) mutants exhibit both reduced susceptibility to P. syringae pv. tomato DC3000 and reduced sensitivity to COR, whereas jasmonate-resistant 1 (jar1) plants exhibit wild-type responses to both COR and P. syringae pv. tomato DC3000. A jin1 jar1 double mutant does not exhibit enhanced jasmonate insensitivity, suggesting that JIN1 functions downstream of jasmonic acid-amino acid conjugates synthesized by JAR1. Reduced disease susceptibility in jin1 mutants is correlated with elevated expression of pathogenesis-related 1 (PR-1) and is dependent on accumulation of salicylic acid (SA). We also show that JIN1 is required for normal P. syringae pv. tomato DC3000 symptom development through an SA-independent mechanism. Thus, P. syringae pv. tomato DC3000 appears to utilize COR to manipulate JIN1-dependent jasmonate signaling both to suppress SA-mediated defenses and to promote symptom development.


Subject(s)
Arabidopsis Proteins/genetics , Arabidopsis Proteins/metabolism , Arabidopsis/genetics , Arabidopsis/metabolism , Basic Helix-Loop-Helix Leucine Zipper Transcription Factors/genetics , Basic Helix-Loop-Helix Leucine Zipper Transcription Factors/metabolism , Plant Diseases/microbiology , Pseudomonas syringae/physiology , Salicylic Acid/metabolism , Amino Acids/pharmacology , Arabidopsis/drug effects , Arabidopsis/microbiology , Cyclopentanes/metabolism , Gene Expression Regulation, Plant , Indenes/pharmacology , Mutation , Nucleotidyltransferases/genetics , Nucleotidyltransferases/metabolism , Oxylipins , Plant Diseases/genetics , Plant Roots/drug effects , Plant Roots/growth & development , Plant Roots/metabolism , Signal Transduction
20.
J Biol Chem ; 280(48): 40271-81, 2005 Dec 02.
Article in English | MEDLINE | ID: mdl-16192267

ABSTRACT

Cyanobacterium Nostoc commune can tolerate the simultaneous stresses of desiccation, UV irradiation, and oxidation. Acidic WspA, of approximately 33.6 kDa, is secreted to the three-dimensional extracellular matrix and accounts for greater than 70% of the total soluble protein. The wspA gene of N. commune strain DRH1 was cloned and found in a single genomic copy, in a monocistronic operon. Transcription of wspA and sodF (superoxide dismutase), and synthesis and secretion of WspA, were induced upon desiccation or UV-A/B irradiation of cells. Recombinant WspA binds the UV-A/B absorbing pigment scytonemin through non-covalent interactions. WspA peptide polymorphism, and heterogeneity of multiple wspA sequences within cells of a single colony, account for distinct WspA isoforms. WspA has no similarity to entries in the sequence databases and wspA, a possible xenolog, is restricted to a subset of strains in the "form species" N. commune characterized through group I intron phylogeny. We hypothesize that WspA plays a central role in the global stress response of N. commune through modulation of the structure and function of the three-dimensional extracellular matrix, particularly the transport, distribution, and/or macromolecular architecture of mycosporine and scytonemin UV-A/B absorbing pigment complexes.


Subject(s)
Extracellular Matrix/metabolism , Nostoc commune/metabolism , Amino Acid Sequence , Base Sequence , Chromatography, Liquid , Heat-Shock Proteins/metabolism , Indoles/chemistry , Indoles/metabolism , Isoelectric Focusing , Mass Spectrometry , Models, Chemical , Models, Genetic , Molecular Sequence Data , Open Reading Frames , Oxygen/chemistry , Penicillanic Acid/analogs & derivatives , Penicillanic Acid/chemistry , Peptides/chemistry , Phenols/chemistry , Phenols/metabolism , Phylogeny , Polymorphism, Genetic , Polysaccharides/chemistry , Promoter Regions, Genetic , Protein Binding , Protein Structure, Tertiary , Recombinant Proteins/chemistry , Superoxide Dismutase/metabolism , Transcription, Genetic , Ultraviolet Rays
SELECTION OF CITATIONS
SEARCH DETAIL
...