RESUMO
The genetic structure of the indigenous hunter-gatherer peoples of southern Africa, the oldest known lineage of modern human, is important for understanding human diversity. Studies based on mitochondrial and small sets of nuclear markers have shown that these hunter-gatherers, known as Khoisan, San, or Bushmen, are genetically divergent from other humans. However, until now, fully sequenced human genomes have been limited to recently diverged populations. Here we present the complete genome sequences of an indigenous hunter-gatherer from the Kalahari Desert and a Bantu from southern Africa, as well as protein-coding regions from an additional three hunter-gatherers from disparate regions of the Kalahari. We characterize the extent of whole-genome and exome diversity among the five men, reporting 1.3 million novel DNA differences genome-wide, including 13,146 novel amino acid variants. In terms of nucleotide substitutions, the Bushmen seem to be, on average, more different from each other than, for example, a European and an Asian. Observed genomic differences between the hunter-gatherers and others may help to pinpoint genetic adaptations to an agricultural lifestyle. Adding the described variants to current databases will facilitate inclusion of southern Africans in medical research efforts, particularly when family and medical histories can be correlated with genome-wide data.
Assuntos
População Negra/genética , Etnicidade/genética , Genoma Humano/genética , Povo Asiático/genética , Éxons/genética , Genética Médica , Humanos , Filogenia , Polimorfismo de Nucleotídeo Único/genética , África do Sul/etnologia , População Branca/genéticaRESUMO
In 1994, two independent groups extracted DNA from several Pleistocene epoch mammoths and noted differences among individual specimens. Subsequently, DNA sequences have been published for a number of extinct species. However, such ancient DNA is often fragmented and damaged, and studies to date have typically focused on short mitochondrial sequences, never yielding more than a fraction of a per cent of any nuclear genome. Here we describe 4.17 billion bases (Gb) of sequence from several mammoth specimens, 3.3 billion (80%) of which are from the woolly mammoth (Mammuthus primigenius) genome and thus comprise an extensive set of genome-wide sequence from an extinct species. Our data support earlier reports that elephantid genomes exceed 4 Gb. The estimated divergence rate between mammoth and African elephant is half of that between human and chimpanzee. The observed number of nucleotide differences between two particular mammoths was approximately one-eighth of that between one of them and the African elephant, corresponding to a separation between the mammoths of 1.5-2.0 Myr. The estimated probability that orthologous elephant and mammoth amino acids differ is 0.002, corresponding to about one residue per protein. Differences were discovered between mammoth and African elephant in amino-acid positions that are otherwise invariant over several billion years of combined mammalian evolution. This study shows that nuclear genome sequencing of extinct species can reveal population differences not evident from the fossil record, and perhaps even discover genetic factors that affect extinction.
Assuntos
Núcleo Celular/genética , Elefantes/genética , Evolução Molecular , Extinção Biológica , Fósseis , Genoma/genética , Genômica , Análise de Sequência de DNA/métodos , África , Animais , Sequência Conservada/genética , Elefantes/anatomia & histologia , Feminino , Cabelo/metabolismo , Humanos , Índia , Masculino , FilogeniaRESUMO
Taking advantage of the complete genome sequences of several mammals, we developed a novel method to detect losses of well-established genes in the human genome through syntenic mapping of gene structures between the human, mouse, and dog genomes. Unlike most previous genomic methods for pseudogene identification, this analysis is able to differentiate losses of well-established genes from pseudogenes formed shortly after segmental duplication or generated via retrotransposition. Therefore, it enables us to find genes that were inactivated long after their birth, which were likely to have evolved nonredundant biological functions before being inactivated. The method was used to look for gene losses along the human lineage during the approximately 75 million years (My) since the common ancestor of primates and rodents (the euarchontoglire crown group). We identified 26 losses of well-established genes in the human genome that were all lost at least 50 My after their birth. Many of them were previously characterized pseudogenes in the human genome, such as GULO and UOX. Our methodology is highly effective at identifying losses of single-copy genes of ancient origin, allowing us to find a few well-known pseudogenes in the human genome missed by previous high-throughput genome-wide studies. In addition to confirming previously known gene losses, we identified 16 previously uncharacterized human pseudogenes that are definitive losses of long-established genes. Among them is ACYL3, an ancient enzyme present in archaea, bacteria, and eukaryotes, but lost approximately 6 to 8 Mya in the ancestor of humans and chimps. Although losses of well-established genes do not equate to adaptive gene losses, they are a useful proxy to use when searching for such genetic changes. This is especially true for adaptive losses that occurred more than 250,000 years ago, since any genetic evidence of the selective sweep indicative of such an event has been erased.
Assuntos
Evolução Biológica , Mapeamento Cromossômico/métodos , Análise Mutacional de DNA/métodos , Evolução Molecular , Deleção de Genes , Genoma Humano/genética , Pseudogenes/genética , Animais , Cães , Variação Genética/genética , Genômica/métodos , Humanos , CamundongosRESUMO
The University of California Santa Cruz (UCSC) Proteome Browser provides a wealth of protein information presented in graphical images and with links to other protein-related Internet sites. The Proteome Browser is tightly integrated with the UCSC Genome Browser. For the first time, Genome Browser users have both the genome and proteome worlds at their fingertips simultaneously. The Proteome Browser displays tracks of protein and genomic sequences, exon structure, polarity, hydrophobicity, locations of cysteine and glycosylation potential, Superfamily domains and amino acids that deviate from normal abundance. Histograms show genome-wide distribution of protein properties, including isoelectric point, molecular weight, number of exons, InterPro domains and cysteine locations, together with specific property values of the selected protein. The Proteome Browser also provides links to gene annotations in the Genome Browser, the Known Genes details page and the Gene Sorter; domain information from Superfamily, InterPro and Pfam; three-dimensional structures at the Protein Data Bank and ModBase; and pathway data at KEGG, BioCarta/CGAP and BioCyc. As of August 2004, the Proteome Browser is available for human, mouse and rat proteomes. The browser may be accessed from any Known Genes details page of the Genome Browser at http://genome.ucsc.edu. A user's guide is also available on this website.
Assuntos
Bases de Dados de Proteínas , Proteínas/química , Proteínas/genética , Proteômica , California , Genômica , Humanos , Integração de Sistemas , Interface Usuário-ComputadorRESUMO
PURPOSE: The transit temperature profiles, mean kinetic temperatures (MKTs), and stability of insulin samples in both insulated and noninsulated containers exposed to summer and winter temperatures were evaluated. METHODS: Regular insulin, isophane insulin human (NPH) insulin, and 70% isophane-30% regular (70/30) insulin were packaged in the most commonly dispensed quantity of four vials in noninsulated mailers and insulated containers with frozen gel packs. After packaging, sealed containers and mailers were placed in an environmental chamber for 24-120 hours and exposed to summer and winter transit conditions. Temperatures inside the environmental chamber were recorded every 15 minutes and maintained within 3 degrees C of the specified transit temperature. After exposure to the transit conditions, insulin cartons were removed from their packaging, visually inspected for changes in physical appearance, and stored at 4 degrees C until analysis. The MKT of each package was calculated. High-performance liquid chromatography was performed to determine sample stability, and size-exclusion chromatography was conducted to detect aggregate products of insulin. RESULTS: Regardless of shipping conditions or packaging, all samples met the United States Pharmacopeia's ( USP's) specified limits and retained product stability. Visual inspection of the physical appearance of insulin samples before and after temperature exposure revealed results similar to those described in the product inserts. Microscopic analysis of the injectable suspensions confirmed similar crystal morphologies before and after temperature exposure. CONCLUSION: Regular, NPH, and 70/30 insulin maintained potency within USP limits when stored in programmable environmental chambers simulating summer and winter overnight or three- to five-day ground delivery conditions, regardless of packaging material.
Assuntos
Armazenamento de Medicamentos , Insulina/química , Cromatografia em Gel , Cromatografia Líquida de Alta Pressão , Embalagem de Medicamentos/métodos , Estabilidade de Medicamentos , Ambiente Controlado , Temperatura , Meios de TransporteRESUMO
As vertebrate genome sequences near completion and research refocuses to their analysis, the issue of effective genome annotation display becomes critical. A mature web tool for rapid and reliable display of any requested portion of the genome at any scale, together with several dozen aligned annotation tracks, is provided at http://genome.ucsc.edu. This browser displays assembly contigs and gaps, mRNA and expressed sequence tag alignments, multiple gene predictions, cross-species homologies, single nucleotide polymorphisms, sequence-tagged sites, radiation hybrid data, transposon repeats, and more as a stack of coregistered tracks. Text and sequence-based searches provide quick and precise access to any region of specific interest. Secondary links from individual features lead to sequence details and supplementary off-site databases. One-half of the annotation tracks are computed at the University of California, Santa Cruz from publicly available sequence data; collaborators worldwide provide the rest. Users can stably add their own custom tracks to the browser for educational or research purposes. The conceptual and technical framework of the browser, its underlying MYSQL database, and overall use are described. The web site currently serves over 50,000 pages per day to over 3000 different users.