Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 54
Filter
1.
Nature ; 584(7820): 244-251, 2020 08.
Article in English | MEDLINE | ID: mdl-32728217

ABSTRACT

DNase I hypersensitive sites (DHSs) are generic markers of regulatory DNA1-5 and contain genetic variations associated with diseases and phenotypic traits6-8. We created high-resolution maps of DHSs from 733 human biosamples encompassing 438 cell and tissue types and states, and integrated these to delineate and numerically index approximately 3.6 million DHSs within the human genome sequence, providing a common coordinate system for regulatory DNA. Here we show that these maps highly resolve the cis-regulatory compartment of the human genome, which encodes unexpectedly diverse cell- and tissue-selective regulatory programs at very high density. These programs can be captured comprehensively by a simple vocabulary that enables the assignment to each DHS of a regulatory barcode that encapsulates its tissue manifestations, and global annotation of protein-coding and non-coding RNA genes in a manner orthogonal to gene expression. Finally, we show that sharply resolved DHSs markedly enhance the genetic association and heritability signals of diseases and traits. Rather than being confined to a small number of distal elements or promoters, we find that genetic signals converge on congruently regulated sets of DHSs that decorate entire gene bodies. Together, our results create a universal, extensible coordinate system and vocabulary for human regulatory DNA marked by DHSs, and provide a new global perspective on the architecture of human gene regulation.


Subject(s)
Chromatin/genetics , DNA/metabolism , Deoxyribonuclease I/metabolism , Molecular Sequence Annotation , Chromatin/chemistry , Chromatin/metabolism , DNA/chemistry , DNA/genetics , Gene Expression Regulation , Genes/genetics , Genome, Human/genetics , Humans , Promoter Regions, Genetic/genetics , Regulatory Sequences, Nucleic Acid/genetics
2.
Nature ; 583(7818): 729-736, 2020 07.
Article in English | MEDLINE | ID: mdl-32728250

ABSTRACT

Combinatorial binding of transcription factors to regulatory DNA underpins gene regulation in all organisms. Genetic variation in regulatory regions has been connected with diseases and diverse phenotypic traits1, but it remains challenging to distinguish variants that affect regulatory function2. Genomic DNase I footprinting enables the quantitative, nucleotide-resolution delineation of sites of transcription factor occupancy within native chromatin3-6. However, only a small fraction of such sites have been precisely resolved on the human genome sequence6. Here, to enable comprehensive mapping of transcription factor footprints, we produced high-density DNase I cleavage maps from 243 human cell and tissue types and states and integrated these data to delineate about 4.5 million compact genomic elements that encode transcription factor occupancy at nucleotide resolution. We map the fine-scale structure within about 1.6 million DNase I-hypersensitive sites and show that the overwhelming majority are populated by well-spaced sites of single transcription factor-DNA interaction. Cell-context-dependent cis-regulation is chiefly executed by wholesale modulation of accessibility at regulatory DNA rather than by differential transcription factor occupancy within accessible elements. We also show that the enrichment of genetic variants associated with diseases or phenotypic traits in regulatory regions1,7 is almost entirely attributable to variants within footprints, and that functional variants that affect transcription factor occupancy are nearly evenly partitioned between loss- and gain-of-function alleles. Unexpectedly, we find increased density of human genetic variation within transcription factor footprints, revealing an unappreciated driver of cis-regulatory evolution. Our results provide a framework for both global and nucleotide-precision analyses of gene regulatory mechanisms and functional genetic variation.


Subject(s)
DNA Footprinting/standards , Genome, Human/genetics , Transcription Factors/metabolism , Consensus Sequence , DNA/genetics , DNA/metabolism , Deoxyribonuclease I/metabolism , Genetics, Population , Genome-Wide Association Study , Humans , Models, Molecular , Polymorphism, Single Nucleotide , Regulatory Sequences, Nucleic Acid/genetics
3.
Nature ; 583(7818): 699-710, 2020 07.
Article in English | MEDLINE | ID: mdl-32728249

ABSTRACT

The human and mouse genomes contain instructions that specify RNAs and proteins and govern the timing, magnitude, and cellular context of their production. To better delineate these elements, phase III of the Encyclopedia of DNA Elements (ENCODE) Project has expanded analysis of the cell and tissue repertoires of RNA transcription, chromatin structure and modification, DNA methylation, chromatin looping, and occupancy by transcription factors and RNA-binding proteins. Here we summarize these efforts, which have produced 5,992 new experimental datasets, including systematic determinations across mouse fetal development. All data are available through the ENCODE data portal (https://www.encodeproject.org), including phase II ENCODE1 and Roadmap Epigenomics2 data. We have developed a registry of 926,535 human and 339,815 mouse candidate cis-regulatory elements, covering 7.9 and 3.4% of their respective genomes, by integrating selected datatypes associated with gene regulation, and constructed a web-based server (SCREEN; http://screen.encodeproject.org) to provide flexible, user-defined access to this resource. Collectively, the ENCODE data and registry provide an expansive resource for the scientific community to build a better understanding of the organization and function of the human and mouse genomes.


Subject(s)
DNA/genetics , Databases, Genetic , Genome/genetics , Genomics , Molecular Sequence Annotation , Registries , Regulatory Sequences, Nucleic Acid/genetics , Animals , Chromatin/genetics , Chromatin/metabolism , DNA/chemistry , DNA Footprinting , DNA Methylation/genetics , DNA Replication Timing , Deoxyribonuclease I/metabolism , Genome, Human , Histones/metabolism , Humans , Mice , Mice, Transgenic , RNA-Binding Proteins/genetics , Transcription, Genetic/genetics , Transposases/metabolism
4.
Cell ; 143(5): 837-47, 2010 Nov 24.
Article in English | MEDLINE | ID: mdl-21111241

ABSTRACT

Understanding the prevailing mutational mechanisms responsible for human genome structural variation requires uniformity in the discovery of allelic variants and precision in terms of breakpoint delineation. We develop a resource based on capillary end sequencing of 13.8 million fosmid clones from 17 human genomes and characterize the complete sequence of 1054 large structural variants corresponding to 589 deletions, 384 insertions, and 81 inversions. We analyze the 2081 breakpoint junctions and infer potential mechanism of origin. Three mechanisms account for the bulk of germline structural variation: microhomology-mediated processes involving short (2-20 bp) stretches of sequence (28%), nonallelic homologous recombination (22%), and L1 retrotransposition (19%). The high quality and long-range continuity of the sequence reveals more complex mutational mechanisms, including repeat-mediated inversions and gene conversion, that are most often missed by other methods, such as comparative genomic hybridization, single nucleotide polymorphism microarrays, and next-generation sequencing.


Subject(s)
Genome, Human , Genomic Structural Variation , Mutation , Base Sequence , Gene Conversion , Humans , Molecular Sequence Data , Sequence Analysis, DNA
6.
Nature ; 518(7539): 317-30, 2015 Feb 19.
Article in English | MEDLINE | ID: mdl-25693563

ABSTRACT

The reference human genome sequence set the stage for studies of genetic variation and its association with human disease, but epigenomic studies lack a similar reference. To address this need, the NIH Roadmap Epigenomics Consortium generated the largest collection so far of human epigenomes for primary cells and tissues. Here we describe the integrative analysis of 111 reference human epigenomes generated as part of the programme, profiled for histone modification patterns, DNA accessibility, DNA methylation and RNA expression. We establish global maps of regulatory elements, define regulatory modules of coordinated activity, and their likely activators and repressors. We show that disease- and trait-associated genetic variants are enriched in tissue-specific epigenomic marks, revealing biologically relevant cell types for diverse human traits, and providing a resource for interpreting the molecular basis of human disease. Our results demonstrate the central role of epigenomic information for understanding gene regulation, cellular differentiation and human disease.


Subject(s)
Epigenesis, Genetic/genetics , Epigenomics , Genome, Human/genetics , Base Sequence , Cell Lineage/genetics , Cells, Cultured , Chromatin/chemistry , Chromatin/genetics , Chromatin/metabolism , Chromosomes, Human/chemistry , Chromosomes, Human/genetics , Chromosomes, Human/metabolism , DNA/chemistry , DNA/genetics , DNA/metabolism , DNA Methylation , Datasets as Topic , Enhancer Elements, Genetic/genetics , Genetic Variation/genetics , Genome-Wide Association Study , Histones/metabolism , Humans , Organ Specificity/genetics , RNA/genetics , Reference Values
7.
Nature ; 515(7527): 365-70, 2014 Nov 20.
Article in English | MEDLINE | ID: mdl-25409825

ABSTRACT

The basic body plan and major physiological axes have been highly conserved during mammalian evolution, yet only a small fraction of the human genome sequence appears to be subject to evolutionary constraint. To quantify cis- versus trans-acting contributions to mammalian regulatory evolution, we performed genomic DNase I footprinting of the mouse genome across 25 cell and tissue types, collectively defining ∼8.6 million transcription factor (TF) occupancy sites at nucleotide resolution. Here we show that mouse TF footprints conjointly encode a regulatory lexicon that is ∼95% similar with that derived from human TF footprints. However, only ∼20% of mouse TF footprints have human orthologues. Despite substantial turnover of the cis-regulatory landscape, nearly half of all pairwise regulatory interactions connecting mouse TF genes have been maintained in orthologous human cell types through evolutionary innovation of TF recognition sequences. Furthermore, the higher-level organization of mouse TF-to-TF connections into cellular network architectures is nearly identical with human. Our results indicate that evolutionary selection on mammalian gene regulation is targeted chiefly at the level of trans-regulatory circuitry, enabling and potentiating cis-regulatory plasticity.


Subject(s)
Conserved Sequence/genetics , Evolution, Molecular , Mammals/genetics , Regulatory Sequences, Nucleic Acid/genetics , Transcription Factors/genetics , Transcription Factors/metabolism , Animals , DNA Footprinting , Gene Expression Regulation, Developmental/genetics , Gene Regulatory Networks/genetics , Humans , Mice
8.
Nature ; 515(7527): 355-64, 2014 Nov 20.
Article in English | MEDLINE | ID: mdl-25409824

ABSTRACT

The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways. To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types. By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization. Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases.


Subject(s)
Genome/genetics , Genomics , Mice/genetics , Molecular Sequence Annotation , Animals , Cell Lineage/genetics , Chromatin/genetics , Chromatin/metabolism , Conserved Sequence/genetics , DNA Replication/genetics , Deoxyribonuclease I/metabolism , Gene Expression Regulation/genetics , Gene Regulatory Networks/genetics , Genome-Wide Association Study , Humans , RNA/genetics , Regulatory Sequences, Nucleic Acid/genetics , Species Specificity , Transcription Factors/metabolism , Transcriptome/genetics
9.
J Am Soc Nephrol ; 30(3): 421-441, 2019 Mar.
Article in English | MEDLINE | ID: mdl-30760496

ABSTRACT

BACKGROUND: Linking genetic risk loci identified by genome-wide association studies (GWAS) to their causal genes remains a major challenge. Disease-associated genetic variants are concentrated in regions containing regulatory DNA elements, such as promoters and enhancers. Although researchers have previously published DNA maps of these regulatory regions for kidney tubule cells and glomerular endothelial cells, maps for podocytes and mesangial cells have not been available. METHODS: We generated regulatory DNA maps (DNase-seq) and paired gene expression profiles (RNA-seq) from primary outgrowth cultures of human glomeruli that were composed mainly of podocytes and mesangial cells. We generated similar datasets from renal cortex cultures, to compare with those of the glomerular cultures. Because regulatory DNA elements can act on target genes across large genomic distances, we also generated a chromatin conformation map from freshly isolated human glomeruli. RESULTS: We identified thousands of unique regulatory DNA elements, many located close to transcription factor genes, which the glomerular and cortex samples expressed at different levels. We found that genetic variants associated with kidney diseases (GWAS) and kidney expression quantitative trait loci were enriched in regulatory DNA regions. By combining GWAS, epigenomic, and chromatin conformation data, we functionally annotated 46 kidney disease genes. CONCLUSIONS: We demonstrate a powerful approach to functionally connect kidney disease-/trait-associated loci to their target genes by leveraging unique regulatory DNA maps and integrated epigenomic and genetic analysis. This process can be applied to other kidney cell types and will enhance our understanding of genome regulation and its effects on gene expression in kidney disease.

10.
Nature ; 489(7414): 83-90, 2012 Sep 06.
Article in English | MEDLINE | ID: mdl-22955618

ABSTRACT

Regulatory factor binding to genomic DNA protects the underlying sequence from cleavage by DNase I, leaving nucleotide-resolution footprints. Using genomic DNase I footprinting across 41 diverse cell and tissue types, we detected 45 million transcription factor occupancy events within regulatory regions, representing differential binding to 8.4 million distinct short sequence elements. Here we show that this small genomic sequence compartment, roughly twice the size of the exome, encodes an expansive repertoire of conserved recognition sequences for DNA-binding proteins that nearly doubles the size of the human cis-regulatory lexicon. We find that genetic variants affecting allelic chromatin states are concentrated in footprints, and that these elements are preferentially sheltered from DNA methylation. High-resolution DNase I cleavage patterns mirror nucleotide-level evolutionary conservation and track the crystallographic topography of protein-DNA interfaces, indicating that transcription factor structure has been evolutionarily imprinted on the human genome sequence. We identify a stereotyped 50-base-pair footprint that precisely defines the site of transcript origination within thousands of human promoters. Finally, we describe a large collection of novel regulatory factor recognition motifs that are highly conserved in both sequence and function, and exhibit cell-selective occupancy patterns that closely parallel major regulators of development, differentiation and pluripotency.


Subject(s)
DNA Footprinting , DNA/genetics , Encyclopedias as Topic , Genome, Human/genetics , Molecular Sequence Annotation , Regulatory Sequences, Nucleic Acid/genetics , Transcription Factors/metabolism , DNA Methylation , DNA-Binding Proteins/metabolism , Deoxyribonuclease I/metabolism , Genomic Imprinting , Genomics , Humans , Polymorphism, Single Nucleotide/genetics , Transcription Initiation Site
11.
Genome Res ; 22(9): 1680-8, 2012 Sep.
Article in English | MEDLINE | ID: mdl-22955980

ABSTRACT

CTCF is a ubiquitously expressed regulator of fundamental genomic processes including transcription, intra- and interchromosomal interactions, and chromatin structure. Because of its critical role in genome function, CTCF binding patterns have long been assumed to be largely invariant across different cellular environments. Here we analyze genome-wide occupancy patterns of CTCF by ChIP-seq in 19 diverse human cell types, including normal primary cells and immortal lines. We observed highly reproducible yet surprisingly plastic genomic binding landscapes, indicative of strong cell-selective regulation of CTCF occupancy. Comparison with massively parallel bisulfite sequencing data indicates that 41% of variable CTCF binding is linked to differential DNA methylation, concentrated at two critical positions within the CTCF recognition sequence. Unexpectedly, CTCF binding patterns were markedly different in normal versus immortal cells, with the latter showing widespread disruption of CTCF binding associated with increased methylation. Strikingly, this disruption is accompanied by up-regulation of CTCF expression, with the result that both normal and immortal cells maintain the same average number of CTCF occupancy sites genome-wide. These results reveal a tight linkage between DNA methylation and the global occupancy patterns of a major sequence-specific regulatory factor.


Subject(s)
DNA Methylation , Repressor Proteins/metabolism , Binding Sites/genetics , CCCTC-Binding Factor , Cell Line , Chromatin Immunoprecipitation , Cluster Analysis , CpG Islands , Gene Expression Regulation , High-Throughput Nucleotide Sequencing , Humans
12.
Genome Res ; 21(3): 402-9, 2011 Mar.
Article in English | MEDLINE | ID: mdl-21282478

ABSTRACT

In Mus spretus, the chloride channel 4 gene Clcn4-2 is X-linked and dosage compensated by X up-regulation and X inactivation, while in the closely related mouse species Mus musculus, Clcn4-2 has been translocated to chromosome 7. We sequenced Clcn4-2 in M. spretus and identified the breakpoints of the evolutionary translocation in the Mus lineage. Genetic and epigenetic differences were observed between the 5'ends of the autosomal and X-linked loci. Remarkably, Clcn4-2 introns have been truncated on chromosome 7 in M. musculus as compared with the X-linked loci from seven other eutherian mammals. Intron sequences specifically preserved in the X-linked loci were significantly enriched in AT-rich oligomers. Genome-wide analyses showed an overall enrichment in AT motifs unique to the eutherian X (except for genes that escape X inactivation), suggesting a role for these motifs in regulation of the X chromosome.


Subject(s)
Chloride Channels/genetics , 5' Flanking Region/genetics , AT Rich Sequence , Animals , Base Sequence , Chloride Channels/metabolism , Chromosome Breakage , Chromosome Mapping , Dosage Compensation, Genetic , Epigenomics , Evolution, Molecular , Female , Gene Dosage , Genes , Genome , Humans , Introns , Male , Mice , Molecular Sequence Data , Muridae , Protein Structure, Tertiary/genetics , Rats , Sequence Analysis, DNA , Species Specificity , X Chromosome/genetics
13.
Nature ; 453(7191): 56-64, 2008 May 01.
Article in English | MEDLINE | ID: mdl-18451855

ABSTRACT

Genetic variation among individual humans occurs on many different scales, ranging from gross alterations in the human karyotype to single nucleotide changes. Here we explore variation on an intermediate scale--particularly insertions, deletions and inversions affecting from a few thousand to a few million base pairs. We employed a clone-based method to interrogate this intermediate structural variation in eight individuals of diverse geographic ancestry. Our analysis provides a comprehensive overview of the normal pattern of structural variation present in these genomes, refining the location of 1,695 structural variants. We find that 50% were seen in more than one individual and that nearly half lay outside regions of the genome previously described as structurally variant. We discover 525 new insertion sequences that are not present in the human reference genome and show that many of these are variable in copy number between individuals. Complete sequencing of 261 structural variants reveals considerable locus complexity and provides insights into the different mutational processes that have shaped the human genome. These data provide the first high-resolution sequence map of human structural variation--a standard for genotyping platforms and a prelude to future individual genome sequencing projects.


Subject(s)
Genetic Variation/genetics , Genome, Human/genetics , Physical Chromosome Mapping , Sequence Analysis, DNA , Chromosome Inversion/genetics , Euchromatin/genetics , Gene Deletion , Geography , Haplotypes , Humans , Mutagenesis, Insertional/genetics , Polymorphism, Single Nucleotide/genetics , Racial Groups/genetics , Reproducibility of Results
14.
Nat Genet ; 37(7): 727-32, 2005 Jul.
Article in English | MEDLINE | ID: mdl-15895083

ABSTRACT

Inversions, deletions and insertions are important mediators of disease and disease susceptibility. We systematically compared the human genome reference sequence with a second genome (represented by fosmid paired-end sequences) to detect intermediate-sized structural variants >8 kb in length. We identified 297 sites of structural variation: 139 insertions, 102 deletions and 56 inversion breakpoints. Using combined literature, sequence and experimental analyses, we validated 112 of the structural variants, including several that are of biomedical relevance. These data provide a fine-scale structural variation map of the human genome and the requisite sequence precision for subsequent genetic studies of human disease.


Subject(s)
Genome, Human , Genomic Instability , Mutation , Polymorphism, Genetic , Base Pairing , Cell Line, Tumor , Computational Biology , Humans , Oligonucleotide Array Sequence Analysis , Reference Values , Sequence Analysis, DNA
15.
Antimicrob Agents Chemother ; 57(5): 2204-15, 2013 May.
Article in English | MEDLINE | ID: mdl-23459479

ABSTRACT

Pseudomonas aeruginosa can develop resistance to polymyxin as a consequence of mutations in the PhoPQ regulatory system, mediated by covalent lipid A modification. Transposon mutagenesis of a polymyxin-resistant phoQ mutant defined 41 novel loci required for resistance, including two regulatory systems, ColRS and CprRS. Deletion of the colRS genes, individually or in tandem, abrogated the polymyxin resistance of a ΔphoQ mutant, as did individual or tandem deletion of cprRS. Individual deletion of colR or colS in a ΔphoQ mutant also suppressed 4-amino-L-arabinose addition to lipid A, consistent with the known role of this modification in polymyxin resistance. Surprisingly, tandem deletion of colRS or cprRS in the ΔphoQ mutant or individual deletion of cprR or cprS failed to suppress 4-amino-L-arabinose addition to lipid A, indicating that this modification alone is not sufficient for PhoPQ-mediated polymyxin resistance in P. aeruginosa. Episomal expression of colRS or cprRS in tandem or of cprR individually complemented the Pm resistance phenotype in the ΔphoQ mutant, while episomal expression of colR, colS, or cprS individually did not. Highly polymyxin-resistant phoQ mutants of P. aeruginosa isolated from polymyxin-treated cystic fibrosis patients harbored mutant alleles of colRS and cprS; when expressed in a ΔphoQ background, these mutant alleles enhanced polymyxin resistance. These results define ColRS and CprRS as two-component systems regulating polymyxin resistance in P. aeruginosa, indicate that addition of 4-amino-L-arabinose to lipid A is not the only PhoPQ-regulated biochemical mechanism required for resistance, and demonstrate that colRS and cprS mutations can contribute to high-level clinical resistance.


Subject(s)
Anti-Bacterial Agents/pharmacology , Bacterial Proteins/genetics , Drug Resistance, Bacterial/drug effects , Gene Expression Regulation, Bacterial/drug effects , Genes, Regulator/drug effects , Polymyxins/pharmacology , Pseudomonas aeruginosa/drug effects , Arabinose/analogs & derivatives , Arabinose/metabolism , Bacterial Proteins/metabolism , Cystic Fibrosis/drug therapy , Cystic Fibrosis/microbiology , DNA Transposable Elements , Drug Resistance, Bacterial/genetics , Gene Deletion , Genetic Complementation Test , Genetic Loci , Humans , Lipid A/metabolism , Mutation , Plasmids , Pseudomonas Infections/drug therapy , Pseudomonas Infections/microbiology , Pseudomonas aeruginosa/genetics , Pseudomonas aeruginosa/isolation & purification , Pseudomonas aeruginosa/metabolism
16.
Nat Methods ; 7(5): 365-71, 2010 May.
Article in English | MEDLINE | ID: mdl-20440878

ABSTRACT

The extent of human genomic structural variation suggests that there must be portions of the genome yet to be discovered, annotated and characterized at the sequence level. We present a resource and analysis of 2,363 new insertion sequences corresponding to 720 genomic loci. We found that a substantial fraction of these sequences are either missing, fragmented or misassigned when compared to recent de novo sequence assemblies from short-read next-generation sequence data. We determined that 18-37% of these new insertions are copy-number polymorphic, including loci that show extensive population stratification among Europeans, Asians and Africans. Complete sequencing of 156 of these insertions identified new exons and conserved noncoding sequences not yet represented in the reference genome. We developed a method to accurately genotype these new insertions by mapping next-generation sequencing datasets to the breakpoint, thereby providing a means to characterize copy-number status for regions previously inaccessible to single-nucleotide polymorphism microarrays.


Subject(s)
Contig Mapping/methods , Genome, Human , Sequence Analysis, DNA/methods , DNA Transposable Elements/genetics , Gene Frequency , Genomic Structural Variation/genetics , Genotype , Humans , In Situ Hybridization, Fluorescence , Molecular Sequence Data , Polymorphism, Genetic
18.
J Bacteriol ; 194(24): 6965-6, 2012 Dec.
Article in English | MEDLINE | ID: mdl-23209222

ABSTRACT

Here we report the complete, accurate 1.89-Mb genome sequence of Francisella tularensis subsp. holarctica strain FSC200, isolated in 1998 in the Swedish municipality Ljusdal, which is in an area where tularemia is highly endemic. This genome is important because strain FSC200 has been extensively used for functional and genetic studies of Francisella and is well-characterized.


Subject(s)
Francisella tularensis/genetics , Genome, Bacterial , Tularemia/microbiology , Bacterial Typing Techniques , Base Sequence , Child, Preschool , DNA, Bacterial/genetics , Female , Francisella tularensis/isolation & purification , Humans , Molecular Sequence Data , Sequence Analysis, DNA , Sweden
19.
Nature ; 440(7088): 1194-8, 2006 Apr 27.
Article in English | MEDLINE | ID: mdl-16641997

ABSTRACT

After the completion of a draft human genome sequence, the International Human Genome Sequencing Consortium has proceeded to finish and annotate each of the 24 chromosomes comprising the human genome. Here we describe the sequencing and analysis of human chromosome 3, one of the largest human chromosomes. Chromosome 3 comprises just four contigs, one of which currently represents the longest unbroken stretch of finished DNA sequence known so far. The chromosome is remarkable in having the lowest rate of segmental duplication in the genome. It also includes a chemokine receptor gene cluster as well as numerous loci involved in multiple human cancers such as the gene encoding FHIT, which contains the most common constitutive fragile site in the genome, FRA3B. Using genomic sequence from chimpanzee and rhesus macaque, we were able to characterize the breakpoints defining a large pericentric inversion that occurred some time after the split of Homininae from Ponginae, and propose an evolutionary history of the inversion.


Subject(s)
Chromosomes, Human, Pair 3/genetics , Animals , Base Sequence , Chromosome Breakage/genetics , Chromosome Inversion/genetics , Contig Mapping , CpG Islands/genetics , DNA, Complementary/genetics , Evolution, Molecular , Expressed Sequence Tags , Human Genome Project , Humans , Macaca mulatta/genetics , Molecular Sequence Data , Pan troglodytes/genetics , Sequence Analysis, DNA , Synteny/genetics
20.
J Bacteriol ; 193(14): 3668-9, 2011 Jul.
Article in English | MEDLINE | ID: mdl-21571998

ABSTRACT

The genome sequence of the aceticlastic methanoarchaeon Methanosaeta concilii GP6, comprised of a 3,008,626-bp chromosome and an 18,019-bp episome, has been determined and exhibits considerable differences in gene content from that of Methanosaeta thermophila.


Subject(s)
Genome, Archaeal , Methane/metabolism , Methanosarcinaceae/genetics , Methanosarcinaceae/metabolism , Base Sequence , DNA, Archaeal/genetics , Methanosarcinaceae/classification , Molecular Sequence Data , Sequence Analysis, DNA
SELECTION OF CITATIONS
SEARCH DETAIL