RESUMO
The HUGO Gene Nomenclature Committee (HGNC) assigns unique symbols and names to human genes. The HGNC database (www.genenames.org) currently contains over 43 000 approved gene symbols, over 19 200 of which are assigned to protein-coding genes, 14 000 to pseudogenes and nearly 9000 to non-coding RNA genes. The public website, www.genenames.org, displays all approved nomenclature within Symbol Reports that contain data curated by HGNC nomenclature advisors and links to related genomic, clinical, and proteomic information. Here, we describe updates to our resource, including improvements to our search facility and new download features.
Assuntos
Bases de Dados Genéticas , Humanos , Genoma , Genômica , Proteômica , Pseudogenes , Terminologia como AssuntoRESUMO
The use of approved nomenclature in publications is vital to enable effective scientific communication and is particularly crucial when discussing genes of clinical relevance. Here, we discuss several examples of cases where the failure of researchers to use a HUGO Gene Nomenclature Committee (HGNC)-approved symbol in publications has led to confusion between unrelated human genes in the literature. We also inform authors of the steps they can take to ensure that they use approved nomenclature in their manuscripts and discuss how referencing HGNC IDs can remove ambiguity when referring to genes that have previously been published with confusing alias symbols.
Assuntos
Bases de Dados Genéticas/normas , Genes/genética , Genoma Humano , Pesquisadores/normas , Terminologia como Assunto , Genômica , HumanosRESUMO
The Orthology Benchmark Service (https://orthology.benchmarkservice.org) is the gold standard for orthology inference evaluation, supported and maintained by the Quest for Orthologs consortium. It is an essential resource to compare existing and new methods of orthology inference (the bedrock for many comparative genomics and phylogenetic analysis) over a standard dataset and through common procedures. The Quest for Orthologs Consortium is dedicated to maintaining the resource up to date, through regular updates of the Reference Proteomes and increasingly accessible data through the OpenEBench platform. For this update, we have added a new benchmark based on curated orthology assertion from the Vertebrate Gene Nomenclature Committee, and provided an example meta-analysis of the public predictions present on the platform.
Assuntos
Benchmarking , Genômica , Filogenia , Genômica/métodos , ProteomaRESUMO
Multiple resources currently exist that predict orthologous relationships between genes. These resources differ both in the methodologies used and in the species they make predictions for. The HGNC Comparison of Orthology Predictions (HCOP) search tool integrates and displays data from multiple ortholog prediction resources for a specified human gene or set of genes. An indication of the reliability of a prediction is provided by the number of resources that support it. HCOP was originally designed to show orthology predictions between human and mouse but has been expanded to include data from a current total of 20 selected vertebrate and model organism species. The HCOP pipeline used to fetch and integrate the information from the disparate ortholog and nomenclature data resources has recently been rewritten, both to enable the inclusion of new data and to take advantage of modern web technologies. Data from HCOP are used extensively in our work naming genes as the Vertebrate Gene Nomenclature Committee (https://vertebrate.genenames.org).
Assuntos
Biologia Computacional/métodos , Genômica/métodos , Homologia de Sequência , Software , Animais , Bases de Dados Genéticas , Humanos , Vertebrados , Navegador , Fluxo de TrabalhoRESUMO
The HUGO Gene Nomenclature Committee (HGNC) has been providing standardized symbols and names for human genes since the late 1970s. As funding agencies change their priorities, finding financial support for critical biomedical resources such as the HGNC becomes ever more challenging. In this article, we outline the key roles the HGNC currently plays in aiding communication and the need for these activities to be maintained.
Assuntos
Bases de Dados Genéticas , Genômica , HumanosRESUMO
The HUGO Gene Nomenclature Committee (HGNC) based at EMBL's European Bioinformatics Institute (EMBL-EBI) assigns unique symbols and names to human genes. There are over 42,000 approved gene symbols in our current database of which over 19 000 are for protein-coding genes. While we still update placeholder and problematic symbols, we are working towards stabilizing symbols where possible; over 2000 symbols for disease associated genes are now marked as stable in our symbol reports. All of our data is available at the HGNC website https://www.genenames.org. The Vertebrate Gene Nomenclature Committee (VGNC) was established to assign standardized nomenclature in line with human for vertebrate species lacking their own nomenclature committee. In addition to the previous VGNC core species of chimpanzee, cow, horse and dog, we now name genes in cat, macaque and pig. Gene groups have been added to VGNC and currently include two complex families: olfactory receptors (ORs) and cytochrome P450s (CYPs). In collaboration with specialists we have also named CYPs in species beyond our core set. All VGNC data is available at https://vertebrate.genenames.org/. This article provides an overview of our online data and resources, focusing on updates over the last two years.
Assuntos
Biologia Computacional/métodos , Bases de Dados Genéticas , Genes/genética , Genômica/métodos , Terminologia como Assunto , Vertebrados/genética , Animais , Humanos , Internet , Proteínas/genética , Especificidade da Espécie , Interface Usuário-Computador , Vertebrados/classificaçãoRESUMO
BACKGROUND: Olfactory receptors (ORs) are G protein-coupled receptors with a crucial role in odor detection. A typical mammalian genome harbors ~ 1000 OR genes and pseudogenes; however, different gene duplication/deletion events have occurred in each species, resulting in complex orthology relationships. While the human OR nomenclature is widely accepted and based on phylogenetic classification into 18 families and further into subfamilies, for other mammals different and multiple nomenclature systems are currently in use, thus concealing important evolutionary and functional insights. RESULTS: Here, we describe the Mutual Maximum Similarity (MMS) algorithm, a systematic classifier for assigning a human-centric nomenclature to any OR gene based on inter-species hierarchical pairwise similarities. MMS was applied to the OR repertoires of seven mammals and zebrafish. Altogether, we assigned symbols to 10,249 ORs. This nomenclature is supported by both phylogenetic and synteny analyses. The availability of a unified nomenclature provides a framework for diverse studies, where textual symbol comparison allows immediate identification of potential ortholog groups as well as species-specific expansions/deletions; for example, Or52e5 and Or52e5b represent a rat-specific duplication of OR52E5. Another example is the complete absence of OR subfamily OR6Z among primate OR symbols. In other mammals, OR6Z members are located in one genomic cluster, suggesting a large deletion in the great ape lineage. An additional 14 mammalian OR subfamilies are missing from the primate genomes. While in chimpanzee 87% of the symbols were identical to human symbols, this number decreased to ~ 50% in dog and cow and to ~ 30% in rodents, reflecting the adaptive changes of the OR gene superfamily across diverse ecological niches. Application of the proposed nomenclature to zebrafish revealed similarity to mammalian ORs that could not be detected from the current zebrafish olfactory receptor gene nomenclature. CONCLUSIONS: We have consolidated a unified standard nomenclature system for the vertebrate OR superfamily. The new nomenclature system will be applied to cow, horse, dog and chimpanzee by the Vertebrate Gene Nomenclature Committee and its implementation is currently under consideration by other relevant species-specific nomenclature committees.
Assuntos
Algoritmos , Receptores Odorantes , Terminologia como Assunto , Vertebrados , Animais , Bovinos , Cães , Genoma , Cavalos , Humanos , Pan troglodytes , Filogenia , Ratos , Receptores Odorantes/genética , Especificidade da Espécie , Sintenia , Vertebrados/genética , Peixe-ZebraRESUMO
The deutocerebral (second) head segment is putatively homologous across Arthropoda, in spite of remarkable disparity of form and function of deutocerebral appendages. In Mandibulata this segment bears a pair of sensory antennae, whereas in Chelicerata the same segment bears a pair of feeding appendages called chelicerae. Part of the evidence for the homology of deutocerebral appendages is the conserved function of homothorax (hth), which has been shown to specify antennal or cheliceral fate in the absence of Hox signaling, in both mandibulate and chelicerate exemplars. However, the genetic basis for the morphological disparity of antenna and chelicera is not understood. To test whether downstream targets of hth have diverged in a lineage-specific manner, we examined the evolution of the function and expression of spineless (ss), which in two holometabolous insects is known to act as a hth target and distal antennal determinant. Toward expanding phylogenetic representation of gene expression data, here we show that strong expression of ss is observed in developing antennae of a hemimetabolous insect, a centipede, and an amphipod crustacean. By contrast, ss orthologs are not expressed throughout the cheliceral limb buds of spiders or harvestmen during developmental stages when appendage fate is specified. RNA interference-mediated knockdown of ss in Oncopeltus fasciatus, which bears a simple plesiomorphic antenna, resulted in homeotic distal antenna-to-leg transformation, comparable to data from holometabolous insect counterparts. Knockdown of hth in Oncopeltus fasciatus abrogated ss expression, suggesting conservation of upstream regulation. These data suggest that ss may be a flagellar (distal antennal) determinant more broadly, and that this function was acquired at the base of Mandibulata.
Assuntos
Artrópodes/anatomia & histologia , Artrópodes/genética , Cabeça/anatomia & histologia , Proteínas de Insetos/genética , Homologia de Sequência de Aminoácidos , Sequência de Aminoácidos , Animais , Feminino , Regulação da Expressão Gênica no Desenvolvimento , Proteínas de Insetos/química , Proteínas de Insetos/metabolismo , Funções Verossimilhança , Modelos Biológicos , Interferência de RNARESUMO
Myriapods (e.g., centipedes and millipedes) display a simple homonomous body plan relative to other arthropods. All members of the class are terrestrial, but they attained terrestriality independently of insects. Myriapoda is the only arthropod class not represented by a sequenced genome. We present an analysis of the genome of the centipede Strigamia maritima. It retains a compact genome that has undergone less gene loss and shuffling than previously sequenced arthropods, and many orthologues of genes conserved from the bilaterian ancestor that have been lost in insects. Our analysis locates many genes in conserved macro-synteny contexts, and many small-scale examples of gene clustering. We describe several examples where S. maritima shows different solutions from insects to similar problems. The insect olfactory receptor gene family is absent from S. maritima, and olfaction in air is likely effected by expansion of other receptor gene families. For some genes S. maritima has evolved paralogues to generate coding sequence diversity, where insects use alternate splicing. This is most striking for the Dscam gene, which in Drosophila generates more than 100,000 alternate splice forms, but in S. maritima is encoded by over 100 paralogues. We see an intriguing linkage between the absence of any known photosensory proteins in a blind organism and the additional absence of canonical circadian clock genes. The phylogenetic position of myriapods allows us to identify where in arthropod phylogeny several particular molecular mechanisms and traits emerged. For example, we conclude that juvenile hormone signalling evolved with the emergence of the exoskeleton in the arthropods and that RR-1 containing cuticle proteins evolved in the lineage leading to Mandibulata. We also identify when various gene expansions and losses occurred. The genome of S. maritima offers us a unique glimpse into the ancestral arthropod genome, while also displaying many adaptations to its specific life history.
Assuntos
Artrópodes/genética , Genoma , Sintenia , Animais , Peptídeos e Proteínas de Sinalização do Ritmo Circadiano/genética , Metilação de DNA , Evolução Molecular , Feminino , Genoma Mitocondrial , Hormônios/genética , Masculino , Família Multigênica , Filogenia , Polimorfismo Genético , Proteínas Quinases/genética , RNA não Traduzido/genética , Receptores Odorantes/genética , Selenoproteínas/genética , Cromossomos Sexuais , Fatores de Transcrição/genéticaRESUMO
The vertebrate limb is one of the most intensively studied organs in the field of developmental biology. Limb development in tetrapod vertebrates is highly conserved and dependent on the interaction of several important molecular pathways. The bone morphogenetic protein (BMP) signaling cascade is one of these pathways and has been shown to be crucial for several aspects of limb development. Here, we have used a Xenopus laevis transgenic line, in which expression of the inhibitor Noggin is under the control of the heat-shock promoter hsp70 to examine the effects of attenuation of BMP signaling at different stages of limb development. Remarkably different phenotypes were produced at different stages, illustrating the varied roles of BMP in development of the limb. Very early limb buds appeared to be refractory to the effects of BMP attenuation, developing normally in most cases. Ectopic limbs were produced by overexpression of Noggin corresponding to a brief window of limb development at about stage 49/50, as recently described by Christen et al. (2012). Attenuation of BMP signaling in stage 51 or 52 tadpoles lead to a reduction in the number of digits formed, resulting in hypodactyly or ectrodactyly, as well as occasional defects in the more proximal tibia-fibula. Finally, inhibition at stage 54 (paddle stage) led to the formation of dramatically shortened digits resulting from loss of distal phalanges. Transcriptome analysis has revealed the possibility that more Noggin-sensitive members of the BMP family could be involved in limb development than previously suspected. Our analysis demonstrates the usefulness of heat-shock-driven gene expression as an effective method for inhibiting a developmental pathway at different times during limb development.
Assuntos
Anfíbios/fisiologia , Proteínas Morfogenéticas Ósseas/fisiologia , Extremidades/embriologia , Xenopus laevis/fisiologia , Animais , Animais Geneticamente Modificados , Proteínas de Transporte/metabolismo , Botões de Extremidades/anormalidades , Deformidades Congênitas dos Membros/veterinária , Proteínas de Xenopus/fisiologiaRESUMO
The Vertebrate Gene Nomenclature Committee (VGNC) was established in 2016 as a sister project to the HUGO Gene Nomenclature Committee, to approve gene nomenclature in vertebrate species without an existing dedicated nomenclature committee. The VGNC aims to harmonize gene nomenclature across selected vertebrate species in line with human gene nomenclature, with orthologs assigned the same nomenclature where possible. This article presents an overview of the VGNC project and discussion of key findings resulting from this work to date. VGNC-approved nomenclature is accessible at https://vertebrate.genenames.org and is additionally displayed by the NCBI, Ensembl, and UniProt databases.
Assuntos
Bases de Dados Genéticas , Vertebrados , Animais , Humanos , Vertebrados/genéticaRESUMO
BACKGROUND: The Hemiptera (aphids, cicadas, and true bugs) are a key insect order, with high diversity for feeding ecology and excellent experimental tractability for molecular genetics. Building upon recent sequencing of hemipteran pests such as phloem-feeding aphids and blood-feeding bed bugs, we present the genome sequence and comparative analyses centered on the milkweed bug Oncopeltus fasciatus, a seed feeder of the family Lygaeidae. RESULTS: The 926-Mb Oncopeltus genome is well represented by the current assembly and official gene set. We use our genomic and RNA-seq data not only to characterize the protein-coding gene repertoire and perform isoform-specific RNAi, but also to elucidate patterns of molecular evolution and physiology. We find ongoing, lineage-specific expansion and diversification of repressive C2H2 zinc finger proteins. The discovery of intron gain and turnover specific to the Hemiptera also prompted the evaluation of lineage and genome size as predictors of gene structure evolution. Furthermore, we identify enzymatic gains and losses that correlate with feeding biology, particularly for reductions associated with derived, fluid nutrition feeding. CONCLUSIONS: With the milkweed bug, we now have a critical mass of sequenced species for a hemimetabolous insect order and close outgroup to the Holometabola, substantially improving the diversity of insect genomics. We thereby define commonalities among the Hemiptera and delve into how hemipteran genomes reflect distinct feeding ecologies. Given Oncopeltus's strength as an experimental model, these new sequence resources bolster the foundation for molecular research and highlight technical considerations for the analysis of medium-sized invertebrate genomes.
Assuntos
Evolução Molecular , Genoma de Inseto , Hemípteros/genética , Sequência de Aminoácidos , Animais , Dedos de Zinco CYS2-HIS2 , Comportamento Alimentar , Dosagem de Genes , Perfilação da Expressão Gênica , Transferência Genética Horizontal , Genes Homeobox , Hemípteros/crescimento & desenvolvimento , Hemípteros/metabolismo , Pigmentação/genética , Olfato , Fatores de Transcrição/genéticaRESUMO
Primordial germ cell (PGC) formation in holometabolous insects like Drosophila melanogaster relies on maternally synthesised germ cell determinants that are asymmetrically localised to the oocyte posterior cortex. Embryonic nuclei that inherit this "germ plasm" acquire PGC fate. In contrast, historical studies of basally branching insects (Hemimetabola) suggest that a maternal requirement for germ line genes in PGC specification may be a derived character confined principally to Holometabola. However, there have been remarkably few investigations of germ line gene expression and function in hemimetabolous insects. Here we characterise PGC formation in the milkweed bug Oncopeltus fasciatus, a member of the sister group to Holometabola, thus providing an important evolutionary comparison to members of this clade. We examine the transcript distribution of orthologues of 19 Drosophila germ cell and/or germ plasm marker genes, and show that none of them localise asymmetrically within Oncopeltus oocytes or early embryos. Using multiple molecular and cytological criteria, we provide evidence that PGCs form after cellularisation at the site of gastrulation. Functional studies of vasa and tudor reveal that these genes are not required for germ cell formation, but that vasa is required in adult males for spermatogenesis. Taken together, our results provide evidence that Oncopeltus germ cells may form in the absence of germ plasm, consistent with the hypothesis that germ plasm is a derived strategy of germ cell specification in insects.