Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 19 de 19
Filter
Add more filters










Publication year range
2.
G3 (Bethesda) ; 9(5): 1633-1646, 2019 05 07.
Article in English | MEDLINE | ID: mdl-30914426

ABSTRACT

Genomic sequence data for non-model organisms are increasingly available requiring the development of efficient and reproducible workflows. Here, we develop the first genomic resources and reproducible workflows for two threatened members of the reef-building coral genus Acropora We generated genomic sequence data from multiple samples of the Caribbean A. cervicornis (staghorn coral) and A. palmata (elkhorn coral), and predicted millions of nucleotide variants among these two species and the Pacific A. digitifera A subset of predicted nucleotide variants were verified using restriction length polymorphism assays and proved useful in distinguishing the two Caribbean acroporids and the hybrid they form ("A. prolifera"). Nucleotide variants are freely available from the Galaxy server (usegalaxy.org), and can be analyzed there with computational tools and stored workflows that require only an internet browser. We describe these data and some of the analysis tools, concentrating on fixed differences between A. cervicornis and A. palmata In particular, we found that fixed amino acid differences between these two species were enriched in proteins associated with development, cellular stress response, and the host's interactions with associated microbes, for instance in the ABC transporters and superoxide dismutase. Identified candidate genes may underlie functional differences in how these threatened species respond to changing environments. Users can expand the presented analyses easily by adding genomic data from additional species, as they become available.


Subject(s)
Anthozoa/genetics , Endangered Species , Genetic Variation , Genome , Genomics , Animals , Anthozoa/classification , Evolution, Molecular , Genetics, Population , Genomics/methods , Geography , INDEL Mutation , Phylogeny , Polymorphism, Restriction Fragment Length , Polymorphism, Single Nucleotide
3.
Genome Announc ; 5(26)2017 Jun 29.
Article in English | MEDLINE | ID: mdl-28663291

ABSTRACT

The draft genome of the Gram-positive spore-forming Anoxybacillus ayderensis strain MT-Cab (Firmicutes), isolated from an enrichment culture of Chloracidobacterium thermophilum, was sequenced and comprises 2,577,015 bp in 92 contigs. The draft genome is predicted to consist of 2,699 protein-coding genes, 73 tRNA-coding genes, and an estimated 8 rRNA operons.

4.
Genome Announc ; 3(2)2015 Mar 26.
Article in English | MEDLINE | ID: mdl-25814606

ABSTRACT

The draft genome sequence of the Deinococcus-Thermus group bacterium Meiothermus ruber strain A, isolated from a cyanobacterial enrichment culture obtained from Octopus Spring (Yellowstone National Park, WY), comprises 2,968,099 bp in 170 contigs. It is predicted to contain 2,895 protein-coding genes, 44 tRNA-coding genes, and 2 rRNA operons.

5.
Genome Announc ; 2(5)2014 Sep 04.
Article in English | MEDLINE | ID: mdl-25189583

ABSTRACT

The draft genome sequence of the thermophilic filamentous anoxygenic phototrophic bacterium Chloroflexus sp. strain MS-G (Chloroflexi), isolated from Mushroom Spring (Yellowstone National Park, WY, USA) was sequenced and comprises 4,784,183 bp in 251 contigs. The draft genome is predicted to encode 4,059 protein coding genes, 49 tRNA encoding genes, and 3 rRNA operons.

6.
Genome Announc ; 2(4)2014 Aug 28.
Article in English | MEDLINE | ID: mdl-25169864

ABSTRACT

The draft genome sequence of the moderately thermophilic bacterium Schleiferia thermophila strain Yellowstone (Bacteroidetes), isolated from Octopus Spring (Yellowstone National Park, WY, USA) was sequenced and comprises 2,617,694 bp in 35 contigs. The draft genome is predicted to encode 2,457 protein coding genes and 37 tRNA encoding genes and two rRNA operons.

7.
Proc Natl Acad Sci U S A ; 110(15): 5823-8, 2013 Apr 09.
Article in English | MEDLINE | ID: mdl-23530231

ABSTRACT

We performed a population genomics study of the aye-aye, a highly specialized nocturnal lemur from Madagascar. Aye-ayes have low population densities and extensive range requirements that could make this flagship species particularly susceptible to extinction. Therefore, knowledge of genetic diversity and differentiation among aye-aye populations is critical for conservation planning. Such information may also advance our general understanding of Malagasy biogeography, as aye-ayes have the largest species distribution of any lemur. We generated and analyzed whole-genome sequence data for 12 aye-ayes from three regions of Madagascar (North, West, and East). We found that the North population is genetically distinct, with strong differentiation from other aye-ayes over relatively short geographic distances. For comparison, the average FST value between the North and East aye-aye populations--separated by only 248 km--is over 2.1-times greater than that observed between human Africans and Europeans. This finding is consistent with prior watershed- and climate-based hypotheses of a center of endemism in northern Madagascar. Taken together, these results suggest a strong and long-term biogeographical barrier to gene flow. Thus, the specific attention that should be directed toward preserving large, contiguous aye-aye habitats in northern Madagascar may also benefit the conservation of other distinct taxonomic units. To help facilitate future ecological- and conservation-motivated population genomic analyses by noncomputational biologists, the analytical toolkit used in this study is available on the Galaxy Web site.


Subject(s)
Genetics, Population , Genomics , Lemur/genetics , Lemur/physiology , Animals , Evolution, Molecular , Genome , Genotype , Geography , Internet , Madagascar , Phylogeny , Polymorphism, Single Nucleotide , Sequence Analysis, DNA , Time Factors
8.
Gigascience ; 2(1): 17, 2013 Dec 30.
Article in English | MEDLINE | ID: mdl-24377391

ABSTRACT

BACKGROUND: Intra-species genetic variation can be used to investigate population structure, selection, and gene flow in non-model vertebrates; and due to the plummeting costs for genome sequencing, it is now possible for small labs to obtain full-genome variation data from their species of interest. However, those labs may not have easy access to, and familiarity with, computational tools to analyze those data. RESULTS: We have created a suite of tools for the Galaxy web server aimed at handling nucleotide and amino-acid polymorphisms discovered by full-genome sequencing of several individuals of the same species, or using a SNP genotyping microarray. In addition to providing user-friendly tools, a main goal is to make published analyses reproducible. While most of the examples discussed in this paper deal with nuclear-genome diversity in non-human vertebrates, we also illustrate the application of the tools to fungal genomes, human biomedical data, and mitochondrial sequences. CONCLUSIONS: This project illustrates that a small group can design, implement, test, document, and distribute a Galaxy tool collection to meet the needs of a particular community of biologists.

9.
BMC Genomics ; 13: 440, 2012 Aug 31.
Article in English | MEDLINE | ID: mdl-22938532

ABSTRACT

BACKGROUND: With over 1.3 billion people, India is estimated to contain three times more genetic diversity than does Europe. Next-generation sequencing technologies have facilitated the understanding of diversity by enabling whole genome sequencing at greater speed and lower cost. While genomes from people of European and Asian descent have been sequenced, only recently has a single male genome from the Indian subcontinent been published at sufficient depth and coverage. In this study we have sequenced and analyzed the genome of a South Asian Indian female (SAIF) from the Indian state of Kerala. RESULTS: We identified over 3.4 million SNPs in this genome including over 89,873 private variations. Comparison of the SAIF genome with several published personal genomes revealed that this individual shared ~50% of the SNPs with each of these genomes. Analysis of the SAIF mitochondrial genome showed that it was closely related to the U1 haplogroup which has been previously observed in Kerala. We assessed the SAIF genome for SNPs with health and disease consequences and found that the individual was at a higher risk for multiple sclerosis and a few other diseases. In analyzing SNPs that modulate drug response, we found a variation that predicts a favorable response to metformin, a drug used to treat diabetes. SNPs predictive of adverse reaction to warfarin indicated that the SAIF individual is not at risk for bleeding if treated with typical doses of warfarin. In addition, we report the presence of several additional SNPs of medical relevance. CONCLUSIONS: This is the first study to report the complete whole genome sequence of a female from the state of Kerala in India. The availability of this complete genome and variants will further aid studies aimed at understanding genetic diversity, identifying clinically relevant changes and assessing disease burden in the Indian population.


Subject(s)
Asian People/genetics , Chromosome Mapping , Genome, Human , Genome, Mitochondrial , Polymorphism, Single Nucleotide , Anticoagulants/adverse effects , DNA Copy Number Variations , Diabetes Mellitus/genetics , Diabetes Mellitus/prevention & control , Female , Genetic Predisposition to Disease , Genetic Variation , Haplotypes , Hemorrhage/chemically induced , Hemorrhage/genetics , Hemorrhage/prevention & control , Humans , Hypoglycemic Agents/therapeutic use , India , Metformin/therapeutic use , Middle Aged , Multiple Sclerosis/genetics , Multiple Sclerosis/prevention & control , Sequence Analysis, DNA , Warfarin/adverse effects
10.
Curr Protoc Bioinformatics ; Chapter 15: 15.2.1-15.2.27, 2012 Sep.
Article in English | MEDLINE | ID: mdl-22948727

ABSTRACT

This unit focuses on some of the tools available on the public Galaxy server that are useful for exploring possible associations between human genetic variants and phenotypes. We trace step-by-step through an example illustrating several methods for examining a single full-coverage genome to look for single-nucleotide polymorphisms (SNPs) that are either known to be associated with disease or suspected to have impact for other reasons. It makes use of public genomic data, tools designed specifically for working with variants, and also some general tools for text manipulation and operations on genomic coordinates.


Subject(s)
Phenotype , Polymorphism, Single Nucleotide , Software , Genetic Variation , Genome, Human , Humans
11.
Proc Natl Acad Sci U S A ; 109(36): E2382-90, 2012 Sep 04.
Article in English | MEDLINE | ID: mdl-22826254

ABSTRACT

Polar bears (PBs) are superbly adapted to the extreme Arctic environment and have become emblematic of the threat to biodiversity from global climate change. Their divergence from the lower-latitude brown bear provides a textbook example of rapid evolution of distinct phenotypes. However, limited mitochondrial and nuclear DNA evidence conflicts in the timing of PB origin as well as placement of the species within versus sister to the brown bear lineage. We gathered extensive genomic sequence data from contemporary polar, brown, and American black bear samples, in addition to a 130,000- to 110,000-y old PB, to examine this problem from a genome-wide perspective. Nuclear DNA markers reflect a species tree consistent with expectation, showing polar and brown bears to be sister species. However, for the enigmatic brown bears native to Alaska's Alexander Archipelago, we estimate that not only their mitochondrial genome, but also 5-10% of their nuclear genome, is most closely related to PBs, indicating ancient admixture between the two species. Explicit admixture analyses are consistent with ancient splits among PBs, brown bears and black bears that were later followed by occasional admixture. We also provide paleodemographic estimates that suggest bear evolution has tracked key climate events, and that PB in particular experienced a prolonged and dramatic decline in its effective population size during the last ca. 500,000 years. We demonstrate that brown bears and PBs have had sufficiently independent evolutionary histories over the last 4-5 million years to leave imprints in the PB nuclear genome that likely are associated with ecological adaptation to the Arctic environment.


Subject(s)
Adaptation, Biological/genetics , Climate Change/history , Evolution, Molecular , Genetics, Population , Genome/genetics , Ursidae/genetics , Animals , Arctic Regions , Base Sequence , Genetic Markers/genetics , History, Ancient , Molecular Sequence Data , Population Density , Population Dynamics , Sequence Analysis, DNA , Species Specificity
12.
Nature ; 469(7331): 529-33, 2011 Jan 27.
Article in English | MEDLINE | ID: mdl-21270892

ABSTRACT

'Orang-utan' is derived from a Malay term meaning 'man of the forest' and aptly describes the southeast Asian great apes native to Sumatra and Borneo. The orang-utan species, Pongo abelii (Sumatran) and Pongo pygmaeus (Bornean), are the most phylogenetically distant great apes from humans, thereby providing an informative perspective on hominid evolution. Here we present a Sumatran orang-utan draft genome assembly and short read sequence data from five Sumatran and five Bornean orang-utan genomes. Our analyses reveal that, compared to other primates, the orang-utan genome has many unique features. Structural evolution of the orang-utan genome has proceeded much more slowly than other great apes, evidenced by fewer rearrangements, less segmental duplication, a lower rate of gene family turnover and surprisingly quiescent Alu repeats, which have played a major role in restructuring other primate genomes. We also describe a primate polymorphic neocentromere, found in both Pongo species, emphasizing the gradual evolution of orang-utan genome structure. Orang-utans have extremely low energy usage for a eutherian mammal, far lower than their hominid relatives. Adding their genome to the repertoire of sequenced primates illuminates new signals of positive selection in several pathways including glycolipid metabolism. From the population perspective, both Pongo species are deeply diverse; however, Sumatran individuals possess greater diversity than their Bornean counterparts, and more species-specific variation. Our estimate of Bornean/Sumatran speciation time, 400,000 years ago, is more recent than most previous studies and underscores the complexity of the orang-utan speciation process. Despite a smaller modern census population size, the Sumatran effective population size (N(e)) expanded exponentially relative to the ancestral N(e) after the split, while Bornean N(e) declined over the same period. Overall, the resources and analyses presented here offer new opportunities in evolutionary genomics, insights into hominid biology, and an extensive database of variation for conservation efforts.


Subject(s)
Genetic Variation , Genome/genetics , Pongo abelii/genetics , Pongo pygmaeus/genetics , Animals , Centromere/genetics , Cerebrosides/metabolism , Chromosomes , Evolution, Molecular , Female , Gene Rearrangement/genetics , Genetic Speciation , Genetics, Population , Humans , Male , Phylogeny , Population Density , Population Dynamics , Species Specificity
13.
Curr Protoc Bioinformatics ; Chapter 10: 10.4.1-10.4.14, 2010 Jun.
Article in English | MEDLINE | ID: mdl-20521245

ABSTRACT

The MultiPipMaker World Wide Web server (http://www.bx.psu.edu) provides a tool for aligning multiple DNA sequences and visualizing regions of conservation among them. This unit describes its use and gives an explanation of the resulting output files and supporting tools. Features provided by the server include alignment of up to 20 very long genomic sequences, output choices of a true, nucleotide-level multiple alignment and/or stacked, pairwise percent identity plots, and support for user-specified annotations of genomic features and arbitrary regions, with clickable links to additional information. Input sequences other than the reference can be fragmented, unordered, and unoriented.


Subject(s)
Computational Biology/methods , Internet , Sequence Alignment/methods , Software , Base Sequence , Genome/genetics , Guidelines as Topic
14.
Nature ; 463(7283): 943-7, 2010 Feb 18.
Article in English | MEDLINE | ID: mdl-20164927

ABSTRACT

The genetic structure of the indigenous hunter-gatherer peoples of southern Africa, the oldest known lineage of modern human, is important for understanding human diversity. Studies based on mitochondrial and small sets of nuclear markers have shown that these hunter-gatherers, known as Khoisan, San, or Bushmen, are genetically divergent from other humans. However, until now, fully sequenced human genomes have been limited to recently diverged populations. Here we present the complete genome sequences of an indigenous hunter-gatherer from the Kalahari Desert and a Bantu from southern Africa, as well as protein-coding regions from an additional three hunter-gatherers from disparate regions of the Kalahari. We characterize the extent of whole-genome and exome diversity among the five men, reporting 1.3 million novel DNA differences genome-wide, including 13,146 novel amino acid variants. In terms of nucleotide substitutions, the Bushmen seem to be, on average, more different from each other than, for example, a European and an Asian. Observed genomic differences between the hunter-gatherers and others may help to pinpoint genetic adaptations to an agricultural lifestyle. Adding the described variants to current databases will facilitate inclusion of southern Africans in medical research efforts, particularly when family and medical histories can be correlated with genome-wide data.


Subject(s)
Black People/genetics , Ethnicity/genetics , Genome, Human/genetics , Asian People/genetics , Exons/genetics , Genetics, Medical , Humans , Phylogeny , Polymorphism, Single Nucleotide/genetics , South Africa/ethnology , White People/genetics
15.
Genome Res ; 17(12): 1797-808, 2007 Dec.
Article in English | MEDLINE | ID: mdl-17984227

ABSTRACT

This article describes a set of alignments of 28 vertebrate genome sequences that is provided by the UCSC Genome Browser. The alignments can be viewed on the Human Genome Browser (March 2006 assembly) at http://genome.ucsc.edu, downloaded in bulk by anonymous FTP from http://hgdownload.cse.ucsc.edu/goldenPath/hg18/multiz28way, or analyzed with the Galaxy server at http://g2.bx.psu.edu. This article illustrates the power of this resource for exploring vertebrate and mammalian evolution, using three examples. First, we present several vignettes involving insertions and deletions within protein-coding regions, including a look at some human-specific indels. Then we study the extent to which start codons and stop codons in the human sequence are conserved in other species, showing that start codons are in general more poorly conserved than stop codons. Finally, an investigation of the phylogenetic depth of conservation for several classes of functional elements in the human genome reveals striking differences in the rates and modes of decay in alignability. Each functional class has a distinctive period of stringent constraint, followed by decays that allow (for the case of regulatory regions) or reject (for coding regions and ultraconserved elements) insertions and deletions.


Subject(s)
Conserved Sequence , Databases, Genetic , Sequence Alignment/methods , Animals , Base Sequence , Cats , Cattle , Codon, Initiator/genetics , Codon, Terminator/genetics , Dogs , Genome, Human , Guinea Pigs , Humans , Mice , Molecular Sequence Data , Mutagenesis, Insertional , Rabbits , Rats , Sequence Deletion
16.
Genome Res ; 16(12): 1557-65, 2006 Dec.
Article in English | MEDLINE | ID: mdl-16983148

ABSTRACT

This article analyzes mammalian genome rearrangements at higher resolution than has been published to date. We identify 3171 intervals, covering approximately 92% of the human genome, within which we find no rearrangements larger than 50 kilobases (kb) in the lineages leading to human, mouse, rat, and dog from their most recent common ancestor. Combining intervals that are adjacent in all contemporary species produces 1338 segments that may contain large insertions or deletions but that are free of chromosome fissions or fusions as well as inversions or translocations >50 kb in length. We describe a new method for predicting the ancestral order and orientation of those intervals from their observed adjacencies in modern species. We combine the results from this method with data from chromosome painting experiments to produce a map of an early mammalian genome that accounts for 96.8% of the available human genome sequence data. The precision is further increased by mapping inversions as small as 31 bp. Analysis of the predicted evolutionary breakpoints in the human lineage confirms certain published observations but disagrees with others. Although only a few mammalian genomes are currently sequenced to high precision, our theoretical analyses and computer simulations indicate that our results are reasonably accurate and that they will become highly accurate in the foreseeable future. Our methods were developed as part of a project to reconstruct the genome sequence of the last ancestor of human, dogs, and most other placental mammals.


Subject(s)
Evolution, Molecular , Genome, Human , Genome , Algorithms , Animals , Base Composition , Base Pairing , Chromosome Breakage , Chromosome Inversion , Chromosome Mapping , Chromosome Painting , Chromosomes , Computer Simulation , Dogs , Gene Deletion , Gene Rearrangement , Humans , Mice , Models, Genetic , Rats , Sequence Alignment/methods , Sequence Homology, Nucleic Acid
17.
Genome Res ; 15(10): 1451-5, 2005 Oct.
Article in English | MEDLINE | ID: mdl-16169926

ABSTRACT

Accessing and analyzing the exponentially expanding genomic sequence and functional data pose a challenge for biomedical researchers. Here we describe an interactive system, Galaxy, that combines the power of existing genome annotation databases with a simple Web portal to enable users to search remote resources, combine data from independent queries, and visualize the results. The heart of Galaxy is a flexible history system that stores the queries from each user; performs operations such as intersections, unions, and subtractions; and links to other computational tools. Galaxy can be accessed at http://g2.bx.psu.edu.


Subject(s)
Databases, Genetic , Genome , Biological Evolution , Internet , Promoter Regions, Genetic
18.
Curr Protoc Bioinformatics ; Chapter 10: Unit10.4, 2005 Apr.
Article in English | MEDLINE | ID: mdl-18428743

ABSTRACT

The MultiPipMaker World Wide Web server (http://www.bx.psu.edu) provides a useful tool for aligning multiple sequences and visualizing regions of conservation between them. This unit describes the use of the MultiPipMaker server and gives an explanation of the resulting output files and supporting tools. Features provided by the server include alignment of up to 20 very long genomic sequences, output choices of a true, nucleotide-level multiple alignment or stacked, pairwise percent identity plots, and user-specified annotations for genomic features and elements of choice, with clickable links to additional information. Alignments can include unordered, unoriented secondary sequences.


Subject(s)
Algorithms , Internet , Sequence Alignment/methods , Sequence Analysis, DNA/methods , Software , User-Computer Interface , Base Sequence , Computer Graphics , Molecular Sequence Data
19.
Nucleic Acids Res ; 33(Database issue): D466-70, 2005 Jan 01.
Article in English | MEDLINE | ID: mdl-15608239

ABSTRACT

We describe improvements to two databases that give access to information on genomic sequence similarities, functional elements in DNA and experimental results that demonstrate those functions. GALA, the database of Genome ALignments and Annotations, is now a set of interlinked relational databases for five vertebrate species, human, chimpanzee, mouse, rat and chicken. For each species, GALA records pairwise and multiple sequence alignments, scores derived from those alignments that reflect the likelihood of being under purifying selection or being a regulatory element, and extensive annotations such as genes, gene expression patterns and transcription factor binding sites. The user interface supports simple and complex queries, including operations such as subtraction and intersections as well as clustering and finding elements in proximity to features. dbERGE II, the database of Experimental Results on Gene Expression, contains experimental data from a variety of functional assays. Both databases are now run on the DB2 database management system. Improved hardware and tuning has reduced response times and increased querying capacity, while simplified query interfaces will help direct new users through the querying process. Links are available at http://www.bx.psu.edu/.


Subject(s)
Databases, Nucleic Acid , Genomics , Sequence Alignment , Animals , Chickens/genetics , Database Management Systems , Humans , Mice , Pan troglodytes/genetics , Rats , User-Computer Interface
SELECTION OF CITATIONS
SEARCH DETAIL
...