Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 80
Filter
1.
Diabetologia ; 63(6): 1223-1235, 2020 06.
Article in English | MEDLINE | ID: mdl-32173762

ABSTRACT

AIMS/HYPOTHESIS: Drug and surgical-based therapies in type 2 diabetes are associated with altered gut microbiota architecture. Here we investigated the role of the gut microbiome in improved glucose homeostasis following bariatric surgery. METHODS: We carried out gut microbiome analyses in gastrectomised (by vertical sleeve gastrectomy [VSG]) rats of the Goto-Kakizaki (GK) non-obese model of spontaneously occurring type 2 diabetes, followed by physiological studies in the GK rat. RESULTS: VSG in the GK rat led to permanent improvement of glucose tolerance associated with minor changes in the gut microbiome, mostly characterised by significant enrichment of caecal Prevotella copri. Gut microbiota enrichment with P. copri in GK rats through permissive antibiotic treatment, inoculation of gut microbiota isolated from gastrectomised GK rats, and direct inoculation of P. copri, resulted in significant improvement of glucose tolerance, independent of changes in body weight. Plasma bile acids were increased in GK rats following inoculation with P. copri and P. copri-enriched microbiota from VSG-treated rats; the inoculated GK rats then showed increased liver glycogen and upregulated expression of Fxr (also known as Nr1h4), Srebf1c, Chrebp (also known as Mlxipl) and Il10 and downregulated expression of Cyp7a1. CONCLUSIONS: Our data underline the impact of intestinal P. copri on improved glucose homeostasis through enhanced bile acid metabolism and farnesoid X receptor (FXR) signalling, which may represent a promising opportunity for novel type 2 diabetes therapeutics.


Subject(s)
Diabetes Mellitus, Type 2/microbiology , Gastrointestinal Microbiome/physiology , Prevotella/physiology , Animals , Blood Glucose/metabolism , Body Weight/physiology , Male , Rats , Signal Transduction/physiology
2.
BMC Genomics ; 21(1): 259, 2020 Mar 30.
Article in English | MEDLINE | ID: mdl-32228451

ABSTRACT

BACKGROUND: The olive fruit fly, Bactrocera oleae, is the most important pest in the olive fruit agribusiness industry. This is because female flies lay their eggs in the unripe fruits and upon hatching the larvae feed on the fruits thus destroying them. The lack of a high-quality genome and other genomic and transcriptomic data has hindered progress in understanding the fly's biology and proposing alternative control methods to pesticide use. RESULTS: Genomic DNA was sequenced from male and female Demokritos strain flies, maintained in the laboratory for over 45 years. We used short-, mate-pair-, and long-read sequencing technologies to generate a combined male-female genome assembly (GenBank accession GCA_001188975.2). Genomic DNA sequencing from male insects using 10x Genomics linked-reads technology followed by mate-pair and long-read scaffolding and gap-closing generated a highly contiguous 489 Mb genome with a scaffold N50 of 4.69 Mb and L50 of 30 scaffolds (GenBank accession GCA_001188975.4). RNA-seq data generated from 12 tissues and/or developmental stages allowed for genome annotation. Short reads from both males and females and the chromosome quotient method enabled identification of Y-chromosome scaffolds which were extensively validated by PCR. CONCLUSIONS: The high-quality genome generated represents a critical tool in olive fruit fly research. We provide an extensive RNA-seq data set, and genome annotation, critical towards gaining an insight into the biology of the olive fruit fly. In addition, elucidation of Y-chromosome sequences will advance our understanding of the Y-chromosome's organization, function and evolution and is poised to provide avenues for sterile insect technique approaches.


Subject(s)
Tephritidae/genetics , Y Chromosome/genetics , Y Chromosome/metabolism , Animals , Female , Genome, Insect/genetics , Male , Polymerase Chain Reaction
3.
Cell Microbiol ; 18(10): 1319-38, 2016 10.
Article in English | MEDLINE | ID: mdl-26936325

ABSTRACT

Clustered regularly interspaced short palindromic repeats with CRISPR-associated gene (CRISPR-Cas) systems are widely recognized as critical genome defense systems that protect microbes from external threats such as bacteriophage infection. Several isolates of the intracellular pathogen Legionella pneumophila possess multiple CRISPR-Cas systems (type I-C, type I-F and type II-B), yet the targets of these systems remain unknown. With the recent observation that at least one of these systems (II-B) plays a non-canonical role in supporting intracellular replication, the possibility remained that these systems are vestigial genome defense systems co-opted for other purposes. Our data indicate that this is not the case. Using an established plasmid transformation assay, we demonstrate that type I-C, I-F and II-B CRISPR-Cas provide protection against spacer targets. We observe efficient laboratory acquisition of new spacers under 'priming' conditions, in which initially incomplete target elimination leads to the generation of new spacers and ultimate loss of the invasive DNA. Critically, we identify the first known target of L. pneumophila CRISPR-Cas: a 30 kb episome of unknown function whose interbacterial transfer is guarded against by CRISPR-Cas. We provide evidence that the element can subvert CRISPR-Cas by mutating its targeted sequences - but that primed spacer acquisition may limit this mechanism of escape. Rather than generally impinging on bacterial fitness, this element drives a host specialization event - with improved fitness in Acanthamoeba but a reduced ability to replicate in other hosts and conditions. These observations add to a growing body of evidence that host range restriction can serve as an existential threat to L. pneumophila in the wild.


Subject(s)
Clustered Regularly Interspaced Short Palindromic Repeats , Legionella pneumophila/genetics , Acanthamoeba castellanii/microbiology , Base Sequence , Conserved Sequence , Evolution, Molecular , Genes, Bacterial , Host-Pathogen Interactions , Legionella pneumophila/growth & development , Microbial Viability , Sequence Analysis, DNA
4.
Dev Biol ; 404(2): 149-63, 2015 Aug 15.
Article in English | MEDLINE | ID: mdl-26025923

ABSTRACT

Genome-wide resources, such as collections of cDNA clones encoding for complete proteins (full-ORF clones), are crucial tools for studying the evolution of gene function and genetic interactions. Non-model organisms, in particular marine organisms, provide a rich source of functional diversity. Marine organism genomes are, however, frequently highly polymorphic and encode proteins that diverge significantly from those of well-annotated model genomes. The construction of full-ORF clone collections from non-model organisms is hindered by the difficulty of predicting accurately the N-terminal ends of proteins, and distinguishing recent paralogs from highly polymorphic alleles. We report a computational strategy that overcomes these difficulties, and allows for accurate gene level clustering of transcript data followed by the automated identification of full-ORFs with correct 5'- and 3'-ends. It is robust to polymorphism, includes paralog calling and does not require evolutionary proximity to well annotated model organisms. We developed this pipeline for the ascidian Ciona intestinalis, a highly polymorphic member of the divergent sister group of the vertebrates, emerging as a powerful model organism to study chordate gene function, Gene Regulatory Networks and molecular mechanisms underlying human pathologies. Using this pipeline we have generated the first full-ORF collection for a highly polymorphic marine invertebrate. It contains 19,163 full-ORF cDNA clones covering 60% of Ciona coding genes, and full-ORF orthologs for approximately half of curated human disease-associated genes.


Subject(s)
Ciona intestinalis/genetics , Gene Regulatory Networks/genetics , Genetic Predisposition to Disease , Algorithms , Animals , Base Sequence , Biological Evolution , Evolution, Molecular , Gene Expression Profiling , Humans , Multigene Family/genetics , Open Reading Frames/genetics , Sequence Alignment , Sequence Analysis, DNA
5.
Antimicrob Agents Chemother ; 60(11): 6719-6725, 2016 11.
Article in English | MEDLINE | ID: mdl-27572407

ABSTRACT

Sequencing of the blaIMP-4-carrying C. freundii B38 using the PacBio SMRT technique revealed that the genome contained a chromosome of 5,134,500 bp and three plasmids, pOZ172 (127,005 bp), pOZ181 (277,592 bp), and pOZ182 (18,467 bp). Plasmid pOZ172 was identified as IncFIIY, like pP10164-NDM and pNDM-EcGN174. It carries a class 1 integron with four cassettes (blaIMP-4-qacG2-aacA4-aphA15) and a complete hybrid tni module (tniR-tniQ-tniB-tniA). The recombination of tniR from Tn402 (identical) with tniQBA from Tn5053 (99%) occurred within the res site of Tn402/5053 The Tn402/5053-like integron, named Tn6017, was inserted into Tn1722 at the res II site. The replication, partitioning, and transfer systems of pOZ181 were similar to those of IncHI2 plasmids (e.g., R478) and contained a sul1-type class 1 integron with the cassette array orf-dfrA1-orf-gcu37-aadA5 linked to an upstream Tn1696 tnpA-tnpR and to a downstream 3' conserved sequence (3'-CS) and ISCR1 A Tn2 transposon encoding a blaTEM-1 ß-lactamase was identified on pOZ182. Other interesting resistance determinants encoded on the B38 chromosome included multidrug resistance (MDR) efflux pumps, an AmpC ß-lactamase, and resistances to Cu, Ag, As, and Zn. This is the first report of a complete tni module linked to a blaIMP-4-carrying class 1 integron, which, together with other recently reported non-sul1 integrons, represents the emergence of a distinct evolutionary lineage of class 1 integrons lacking a 3'-CS (qacEΔ1-sul1). The unique cassette array, complete tni module of Tn6017, and incompatibility group of pOZ172 suggest a blaIMP-4 evolutionary pathway in C. freundii B38 different from that for other blaIMP-4 genes found in Gram-negative bacteria in the Western Pacific region.


Subject(s)
Bacterial Proteins/genetics , Citrobacter freundii/genetics , Gene Expression Regulation, Bacterial , Genome, Bacterial , Plasmids/metabolism , beta-Lactamases/genetics , Anti-Bacterial Agents/pharmacology , Bacterial Proteins/metabolism , Biological Evolution , Chromosomes, Bacterial/chemistry , Citrobacter freundii/drug effects , Citrobacter freundii/metabolism , DNA Transposable Elements , Drug Resistance, Multiple, Bacterial/genetics , Integrons , Microbial Sensitivity Tests , Plasmids/chemistry , Sequence Analysis, DNA , beta-Lactamases/metabolism
6.
J Clin Microbiol ; 54(1): 83-92, 2016 Jan.
Article in English | MEDLINE | ID: mdl-26491184

ABSTRACT

An outbreak of type emm59 invasive group A Streptococcus (iGAS) disease was declared in 2008 in Thunder Bay District, Northwestern Ontario, 2 years after a countrywide emm59 epidemic was recognized in Canada. Despite a declining number of emm59 infections since 2010, numerous cases of iGAS disease continue to be reported in the area. We collected clinical information on all iGAS cases recorded in Thunder Bay District from 2008 to 2013. We also emm typed and sequenced the genomes of all available strains isolated from 2011 to 2013 from iGAS infections and from severe cases of soft tissue infections. We used whole-genome sequencing data to investigate the population structure of GAS strains of the most frequently isolated emm types. We report an increased incidence of iGAS in Thunder Bay compared to the metropolitan area of Toronto/Peel and the province of Ontario. Illicit drug use, alcohol abuse, homelessness, and hepatitis C infection were underlying diseases or conditions that might have predisposed patients to iGAS disease. Most cases were caused by clonal strains of skin or generalist emm types (i.e., emm82, emm87, emm101, emm4, emm83, and emm114) uncommonly seen in other areas of the province. We observed rapid waxing and waning of emm types causing disease and their replacement by other emm types associated with the same tissue tropisms. Thus, iGAS disease in Thunder Bay District predominantly affects a select population of disadvantaged persons and is caused by clonally related strains of a few skin and generalist emm types less commonly associated with iGAS in other areas of Ontario.


Subject(s)
Antigens, Bacterial/genetics , Bacterial Outer Membrane Proteins/genetics , Carrier Proteins/genetics , Disease Outbreaks , Genotype , Streptococcal Infections/epidemiology , Streptococcal Infections/microbiology , Streptococcus pyogenes/classification , Streptococcus pyogenes/isolation & purification , Adolescent , Adult , Aged , Aged, 80 and over , Child , Child, Preschool , Female , Genetic Variation , Genotyping Techniques , Humans , Incidence , Infant , Infant, Newborn , Male , Middle Aged , Ontario/epidemiology , Risk Factors , Sequence Analysis, DNA , Streptococcus pyogenes/genetics , Young Adult
7.
Nature ; 463(7282): 775-80, 2010 Feb 11.
Article in English | MEDLINE | ID: mdl-20148032

ABSTRACT

Insulin from the beta-cells of the pancreatic islets of Langerhans controls energy homeostasis in vertebrates, and its deficiency causes diabetes mellitus. During embryonic development, the transcription factor neurogenin 3 (Neurog3) initiates the differentiation of the beta-cells and other islet cell types from pancreatic endoderm, but the genetic program that subsequently completes this differentiation remains incompletely understood. Here we show that the transcription factor Rfx6 directs islet cell differentiation downstream of Neurog3. Mice lacking Rfx6 failed to generate any of the normal islet cell types except for pancreatic-polypeptide-producing cells. In human infants with a similar autosomal recessive syndrome of neonatal diabetes, genetic mapping and subsequent sequencing identified mutations in the human RFX6 gene. These studies demonstrate a unique position for Rfx6 in the hierarchy of factors that coordinate pancreatic islet development in both mice and humans. Rfx6 could prove useful in efforts to generate beta-cells for patients with diabetes.


Subject(s)
Cell Differentiation , DNA-Binding Proteins/metabolism , Insulin/biosynthesis , Islets of Langerhans/cytology , Islets of Langerhans/metabolism , Transcription Factors/metabolism , Animals , Basic Helix-Loop-Helix Transcription Factors/deficiency , Basic Helix-Loop-Helix Transcription Factors/genetics , Basic Helix-Loop-Helix Transcription Factors/metabolism , DNA Mutational Analysis , DNA-Binding Proteins/deficiency , DNA-Binding Proteins/genetics , Diabetes Mellitus/congenital , Diabetes Mellitus/genetics , Diabetes Mellitus/metabolism , Diabetes Mellitus/pathology , Embryo, Mammalian/metabolism , Female , Fetus/metabolism , Gene Expression Profiling , Gene Expression Regulation, Developmental , Genes, Recessive/genetics , Genetic Testing , Humans , Infant, Newborn , Islets of Langerhans/embryology , Male , Mice , NIH 3T3 Cells , Nerve Tissue Proteins/deficiency , Nerve Tissue Proteins/genetics , Nerve Tissue Proteins/metabolism , Organ Specificity , Regulatory Factor X Transcription Factors , Syndrome , Transcription Factors/deficiency , Transcription Factors/genetics
8.
BMC Biol ; 13: 41, 2015 Jun 20.
Article in English | MEDLINE | ID: mdl-26092298

ABSTRACT

BACKGROUND: We report here the first genome-wide high-resolution polymorphism resource for non-human primate (NHP) association and linkage studies, constructed for the Caribbean-origin vervet monkey, or African green monkey (Chlorocebus aethiops sabaeus), one of the most widely used NHPs in biomedical research. We generated this resource by whole genome sequencing (WGS) of monkeys from the Vervet Research Colony (VRC), an NIH-supported research resource for which extensive phenotypic data are available. RESULTS: We identified genome-wide single nucleotide polymorphisms (SNPs) by WGS of 721 members of an extended pedigree from the VRC. From high-depth WGS data we identified more than 4 million polymorphic unequivocal segregating sites; by pruning these SNPs based on heterozygosity, quality control filters, and the degree of linkage disequilibrium (LD) between SNPs, we constructed genome-wide panels suitable for genetic association (about 500,000 SNPs) and linkage analysis (about 150,000 SNPs). To further enhance the utility of these resources for linkage analysis, we used a further pruned subset of the linkage panel to generate multipoint identity by descent matrices. CONCLUSIONS: The genetic and phenotypic resources now available for the VRC and other Caribbean-origin vervets enable their use for genetic investigation of traits relevant to human diseases.


Subject(s)
Chlorocebus aethiops/genetics , Linkage Disequilibrium , Polymorphism, Single Nucleotide , Animals , Chromosome Mapping , Female , Genome-Wide Association Study , Genotype , Humans , Male , Microsatellite Repeats , Phenotype , Quantitative Trait Loci , Sequence Analysis
9.
Emerg Infect Dis ; 21(4): 585-91, 2015 Apr.
Article in English | MEDLINE | ID: mdl-25811284

ABSTRACT

We recently showed that 37/600 (6.2%) invasive infections with group B Streptococcus (GBS) in Toronto, Ontario, Canada, were caused by serotype IV strains. We report a relatively high level of genetic diversity in 37 invasive strains of this emerging GBS serotype. Multilocus sequence typing identified 6 sequence types (STs) that belonged to 3 clonal complexes. Most isolates were ST-459 (19/37, 51%) and ST-452 (11/37, 30%), but we also identified ST-291, ST-3, ST-196, and a novel ST-682. We detected further diversity by performing whole-genome single-nucleotide polymorphism analysis and found evidence of recombination events contributing to variation in some serotype IV GBS strains. We also evaluated antimicrobial drug resistance and found that ST-459 strains were resistant to clindamycin and erythromycin, whereas strains of other STs were, for the most part, susceptible to these antimicrobial drugs.


Subject(s)
Anti-Infective Agents/pharmacology , Drug Resistance, Bacterial , Population Surveillance , Streptococcal Infections/epidemiology , Streptococcal Infections/microbiology , Streptococcus agalactiae/classification , Streptococcus agalactiae/drug effects , Adolescent , Adult , Age Distribution , Aged , Child , Child, Preschool , Computational Biology , Genetic Variation , Humans , Infant , Infant, Newborn , Microbial Sensitivity Tests , Middle Aged , Multilocus Sequence Typing , Ontario/epidemiology , Phylogeny , Serogroup , Streptococcus agalactiae/genetics , Streptococcus agalactiae/isolation & purification , Young Adult
11.
Hum Mol Genet ; 21(15): 3307-16, 2012 Aug 01.
Article in English | MEDLINE | ID: mdl-22556363

ABSTRACT

Non-human primates provide genetic model systems biologically intermediate between humans and other mammalian model organisms. Populations of Caribbean vervet monkeys (Chlorocebus aethiops sabaeus) are genetically homogeneous and large enough to permit well-powered genetic mapping studies of quantitative traits relevant to human health, including expression quantitative trait loci (eQTL). Previous transcriptome-wide investigation in an extended vervet pedigree identified 29 heritable transcripts for which levels of expression in peripheral blood correlate strongly with expression levels in the brain. Quantitative trait linkage analysis using 261 microsatellite markers identified significant (n = 8) and suggestive (n = 4) linkages for 12 of these transcripts, including both cis- and trans-eQTL. Seven transcripts, located on different chromosomes, showed maximum linkage to markers in a single region of vervet chromosome 9; this observation suggests the possibility of a master trans-regulator locus in this region. For one cis-eQTL (at B3GALTL, beta-1,3-glucosyltransferase), we conducted follow-up single nucleotide polymorphism genotyping and fine-scale association analysis in a sample of unrelated Caribbean vervets, localizing this eQTL to a region of <200 kb. These results suggest the value of pedigree and population samples of the Caribbean vervet for linkage and association mapping studies of quantitative traits. The imminent whole genome sequencing of many of these vervet samples will enhance the power of such investigations by providing a comprehensive catalog of genetic variation.


Subject(s)
Chlorocebus aethiops/genetics , Primates/genetics , Quantitative Trait Loci , Animals , Caribbean Region , Genetic Linkage , Genome , Glucuronosyltransferase/genetics , Microsatellite Repeats/genetics , Pedigree , Polymorphism, Single Nucleotide
12.
Nat Genet ; 2024 Jun 27.
Article in English | MEDLINE | ID: mdl-38937606

ABSTRACT

The factors driving or preventing pathological expansion of tandem repeats remain largely unknown. Here, we assessed the FGF14 (GAA)·(TTC) repeat locus in 2,530 individuals by long-read and Sanger sequencing and identified a common 5'-flanking variant in 70.34% of alleles analyzed (3,463/4,923) that represents the phylogenetically ancestral allele and is present on all major haplotypes. This common sequence variation is present nearly exclusively on nonpathogenic alleles with fewer than 30 GAA-pure triplets and is associated with enhanced stability of the repeat locus upon intergenerational transmission and increased Fiber-seq chromatin accessibility.

13.
Genome Res ; 20(5): 636-45, 2010 May.
Article in English | MEDLINE | ID: mdl-20212022

ABSTRACT

Pre-mRNA 5' spliced-leader (SL) trans-splicing occurs in some metazoan groups but not in others. Genome-wide characterization of the trans-spliced mRNA subpopulation has not yet been reported for any metazoan. We carried out a high-throughput analysis of the SL trans-spliced mRNA population of the ascidian tunicate Ciona intestinalis by 454 Life Sciences (Roche) pyrosequencing of SL-PCR-amplified random-primed reverse transcripts of tailbud embryo RNA. We obtained approximately 250,000 high-quality reads corresponding to 8790 genes, approximately 58% of the Ciona total gene number. The great depth of this data revealed new aspects of trans-splicing, including the existence of a significant class of "infrequently trans-spliced" genes, accounting for approximately 28% of represented genes, that generate largely non-trans-spliced mRNAs, but also produce trans-spliced mRNAs, in part through alternative promoter use. Thus, the conventional qualitative dichotomy of trans-spliced versus non-trans-spliced genes should be supplanted by a more accurate quantitative view recognizing frequently and infrequently trans-spliced gene categories. Our data include reads representing approximately 80% of Ciona frequently trans-spliced genes. Our analysis also revealed significant use of closely spaced alternative trans-splice acceptor sites which further underscores the mechanistic similarity of cis- and trans-splicing and indicates that the prevalence of +/-3-nt alternative splicing events at tandem acceptor sites, NAGNAG, is driven by spliceosomal mechanisms, and not nonsense-mediated decay, or selection at the protein level. The breadth of gene representation data enabled us to find new correlations between trans-splicing status and gene function, namely the overrepresentation in the frequently trans-spliced gene class of genes associated with plasma/endomembrane system, Ca(2+) homeostasis, and actin cytoskeleton.


Subject(s)
Alternative Splicing , Ciona intestinalis , Protozoan Proteins/metabolism , RNA, Messenger , RNA, Spliced Leader , Trans-Splicing , Animals , Chromosome Mapping , Ciona intestinalis/embryology , Ciona intestinalis/genetics , Ciona intestinalis/metabolism , DNA, Complementary/genetics , DNA, Complementary/metabolism , Gene Expression Profiling , Models, Genetic , Polymerase Chain Reaction , Protozoan Proteins/chemistry , Protozoan Proteins/genetics , RNA Precursors/genetics , RNA Precursors/metabolism , RNA, Messenger/genetics , RNA, Messenger/metabolism , RNA, Protozoan/genetics , RNA, Protozoan/metabolism , RNA, Spliced Leader/genetics , RNA, Spliced Leader/metabolism , Sequence Analysis, DNA/methods
14.
Nat Genet ; 36(8): 872-6, 2004 Aug.
Article in English | MEDLINE | ID: mdl-15258580

ABSTRACT

We identified the gene carrying the juvenile spermatogonial depletion mutation (jsd), a recessive spermatogenic defect mapped to mouse chromosome 1 (refs. 1,2). We localized jsd to a 272-kb region and resequenced this area to identify the underlying mutation: a frameshift that severely truncates the predicted protein product of a 2.3-kb genomic open reading frame. This gene, Utp14b, evidently arose through reverse transcription of an mRNA from an X-linked gene and integration of the resulting cDNA into an intron of an autosomal gene, whose promoter and 5' untranslated exons are shared with Utp14b. To our knowledge, Utp14b is the first protein-coding retrogene to be linked to a recessive mammalian phenotype. The X-linked progenitor of Utp14b is the mammalian ortholog of yeast Utp14, which encodes a protein required for processing of pre-rRNA and hence for ribosome assembly. Our findings substantiate the hypothesis that mammalian spermatogenesis is supported by autosomal retrogenes that evolved from X-linked housekeeping genes to compensate for silencing of the X chromosome during male meiosis. We find that Utp14b-like retrogenes arose independently and were conserved during evolution in at least four mammalian lineages. This recurrence implies a strong selective pressure, perhaps to enable ribosome assembly in male meiotic cells.


Subject(s)
Spermatogenesis/genetics , Animals , Base Sequence , Chromosome Mapping , Conserved Sequence , Frameshift Mutation , Male , Mice , Mice, Inbred C57BL , Molecular Sequence Data , Open Reading Frames , Phylogeny , Sequence Alignment , X Chromosome
15.
Life Sci Alliance ; 6(1)2023 01.
Article in English | MEDLINE | ID: mdl-36261227

ABSTRACT

The structural arrangements of bacterial chromosomes vary widely between closely related species and can result in significant phenotypic outcomes. The appearance of large-scale chromosomal inversions that are symmetric relative to markers for the origin of replication (OriC) has been previously observed; however, the overall prevalence of replication-associated structural rearrangements (RASRs) in bacteria and their causal mechanisms are currently unknown. Here, we systematically identify the locations of RASRs in species with multiple complete-sequenced genomes and investigate potential mediating biological mechanisms. We found that 247 of 313 species contained sequences with at least one large (>50 Kb) inversion in their sequence comparisons, and the aggregated inversion distances away from symmetry were normally distributed with a mean of zero. Many inversions that were offset from dnaA were found to be centered on a different marker for the OriC Instances of flanking repeats provide evidence that breaks formed during the replication process could be repaired to opposing positions. We also found a strong relationship between the later stages of replication and the range in distance variation from symmetry.


Subject(s)
Chromosomes, Bacterial , Replication Origin , Humans , Replication Origin/genetics , Chromosomes, Bacterial/genetics , Chromosome Inversion/genetics , Sequence Analysis, DNA , Bacteria/genetics
16.
Am J Hum Genet ; 85(3): 377-93, 2009 Sep.
Article in English | MEDLINE | ID: mdl-19732864

ABSTRACT

Common SNPs in the chromosome 17q12-q21 region alter the risk for asthma, type 1 diabetes, primary biliary cirrhosis, and Crohn disease. Previous reports by us and others have linked the disease-associated genetic variants with changes in expression of GSDMB and ORMDL3 transcripts in human lymphoblastoid cell lines (LCLs). The variants also alter regulation of other transcripts, and this domain-wide cis-regulatory effect suggests a mechanism involving long-range chromatin interactions. Here, we further dissect the disease-linked haplotype and identify putative causal DNA variants via a combination of genetic and functional analyses. First, high-throughput resequencing of the region and genotyping of potential candidate variants were performed. Next, additional mapping of allelic expression differences in Yoruba HapMap LCLs allowed us to fine-map the basis of the cis-regulatory differences to a handful of candidate functional variants. Functional assays identified allele-specific differences in nucleosome distribution, an allele-specific association with the insulator protein CTCF, as well as a weak promoter activity for rs12936231. Overall, this study shows a common disease allele linked to changes in CTCF binding and nucleosome occupancy leading to altered domain-wide cis-regulation. Finally, a strong association between asthma and cis-regulatory haplotypes was observed in three independent family-based cohorts (p = 1.78 x 10(-8)). This study demonstrates the requirement of multiple parallel allele-specific tools for the investigation of noncoding disease variants and functional fine-mapping of human disease-associated haplotypes.


Subject(s)
Alleles , Asthma/genetics , Autoimmune Diseases/genetics , Chromatin Assembly and Disassembly/genetics , Egg Proteins/genetics , Membrane Proteins/genetics , Neoplasm Proteins/genetics , Adolescent , Asthma/complications , Autoimmune Diseases/complications , Base Sequence , Cell Line , Child , Chromosomes, Human, Pair 17/genetics , DNA Mutational Analysis , Egg Proteins/metabolism , Female , Genes, Reporter , Genetic Predisposition to Disease , Haplotypes , Humans , Male , Membrane Proteins/metabolism , Molecular Sequence Data , Neoplasm Proteins/metabolism , Pedigree , Polymorphism, Single Nucleotide/genetics , Regulatory Sequences, Nucleic Acid/genetics , White People/genetics
17.
Genome Res ; 19(9): 1542-52, 2009 Sep.
Article in English | MEDLINE | ID: mdl-19605794

ABSTRACT

New high-throughput sequencing technologies are generating large amounts of sequence data, allowing the development of targeted large-scale resequencing studies. For these studies, accurate identification of polymorphic sites is crucial. Heterozygous sites are particularly difficult to identify, especially in regions of low coverage. We present a new strategy for identifying heterozygous sites in a single individual by using a machine learning approach that generates a heterozygosity score for each chromosomal position. Our approach also facilitates the identification of regions with unequal representation of two alleles and other poorly sequenced regions. The availability of confidence scores allows for a principled combination of sequencing results from multiple samples. We evaluate our method on a gold standard data genotype set from HapMap. We are able to classify sites in this data set as heterozygous or homozygous with 98.5% accuracy. In de novo data our probabilistic heterozygote detection ("ProbHD") is able to identify 93% of heterozygous sites at a <5% false call rate (FCR) as estimated based on independent genotyping results. In direct comparison of ProbHD with high-coverage 1000 Genomes sequencing available for a subset of our data, we observe >99.9% overall agreement for genotype calls and close to 90% agreement for heterozygote calls. Overall, our data indicate that high-throughput resequencing of human genomic regions requires careful attention to systematic biases in sample preparation as well as sequence contexts, and that their impact can be alleviated by machine learning-based sequence analyses allowing more accurate extraction of true DNA variants.


Subject(s)
Genome, Human/genetics , Polymorphism, Single Nucleotide/genetics , Probability , Sequence Analysis, DNA/methods , Computational Biology/methods , Genotype , Heterozygote , Humans , Models, Statistical
18.
BMC Med Genet ; 13: 72, 2012 Aug 15.
Article in English | MEDLINE | ID: mdl-22894767

ABSTRACT

BACKGROUND: Zellweger syndrome (ZS) is a peroxisome biogenesis disorder due to mutations in any one of 13 PEX genes. Increased incidence of ZS has been suspected in French-Canadians of the Saguenay-Lac-St-Jean region (SLSJ) of Quebec, but this remains unsolved. METHODS: We identified 5 ZS patients from SLSJ diagnosed by peroxisome dysfunction between 1990-2010 and sequenced all coding exons of known PEX genes in one patient using Next Generation Sequencing (NGS) for diagnostic confirmation. RESULTS: A homozygous mutation (c.802_815del, p.[Val207_Gln294del, Val76_Gln294del]) in PEX6 was identified and then shown in 4 other patients. Parental heterozygosity was confirmed in all. Incidence of ZS was estimated to 1 in 12,191 live births, with a carrier frequency of 1 in 55. In addition, we present data suggesting that this mutation abolishes a SF2/ASF splice enhancer binding site, resulting in the use of two alternative cryptic donor splice sites and predicted to encode an internally deleted in-frame protein. CONCLUSION: We report increased incidence of ZS in French-Canadians of SLSJ caused by a PEX6 founder mutation. To our knowledge, this is the highest reported incidence of ZS worldwide. These findings have implications for carrier screening and support the utility of NGS for molecular confirmation of peroxisomal disorders.


Subject(s)
Adenosine Triphosphatases/genetics , Founder Effect , Mutation , White People/genetics , Zellweger Syndrome/epidemiology , Zellweger Syndrome/genetics , ATPases Associated with Diverse Cellular Activities , Base Sequence , Female , France/ethnology , High-Throughput Nucleotide Sequencing , Humans , Male , Quebec/epidemiology , Zellweger Syndrome/enzymology
19.
Acta Neuropathol ; 124(5): 693-704, 2012 Nov.
Article in English | MEDLINE | ID: mdl-22802095

ABSTRACT

Tangle-predominant dementia (TPD) patients exhibit cognitive decline that is clinically similar to early to moderate-stage Alzheimer disease (AD), yet autopsy reveals neurofibrillary tangles in the medial temporal lobe composed of the microtubule-associated protein tau without significant amyloid-beta (Aß)-positive plaques. We performed a series of neuropathological, biochemical and genetic studies using autopsy brain tissue drawn from a cohort of 34 TPD, 50 AD and 56 control subjects to identify molecular and genetic signatures of this entity. Biochemical analysis demonstrates a similar tau protein isoform composition in TPD and AD, which is compatible with previous histological and ultrastructural studies. Further, biochemical analysis fails to uncover elevation of soluble Aß in TPD frontal cortex and hippocampus compared to control subjects, demonstrating that non-plaque-associated Aß is not a contributing factor. Unexpectedly, we also observed high levels of secretory amyloid precursor protein α (sAPPα) in the frontal cortex of some TPD patients compared to AD and control subjects, suggesting differences in APP processing. Finally, we tested whether TPD is associated with changes in the tau gene (MAPT). Haplotype analysis demonstrates a strong association between TPD and the MAPT H1 haplotype, a genomic inversion associated with some tauopathies and Parkinson disease (PD), when compared to age-matched control subjects with mild degenerative changes, i.e., successful cerebral aging. Next-generation resequencing of MAPT followed by association analysis shows an association between TPD and two polymorphisms in the MAPT 3' untranslated region (UTR). These results support the hypothesis that haplotype-specific variation in the MAPT 3' UTR underlies an Aß-independent mechanism for neurodegeneration in TPD.


Subject(s)
Brain/pathology , Dementia/genetics , Dementia/pathology , Genetic Predisposition to Disease , Neurofibrillary Tangles/genetics , tau Proteins/genetics , Aged , Aged, 80 and over , Alzheimer Disease/genetics , Alzheimer Disease/pathology , Amyloid beta-Protein Precursor/metabolism , Amyloidogenic Proteins/metabolism , Brain/metabolism , Brain/ultrastructure , Chi-Square Distribution , Enzyme-Linked Immunosorbent Assay , Female , Gene Frequency , Genetic Association Studies , Haplotypes , Humans , Linkage Disequilibrium , Male , Neurofibrillary Tangles/pathology , Neurofibrillary Tangles/ultrastructure , Severity of Illness Index , tau Proteins/metabolism
20.
Nature ; 440(7083): 497-500, 2006 Mar 23.
Article in English | MEDLINE | ID: mdl-16554811

ABSTRACT

Chromosome 11, although average in size, is one of the most gene- and disease-rich chromosomes in the human genome. Initial gene annotation indicates an average gene density of 11.6 genes per megabase, including 1,524 protein-coding genes, some of which were identified using novel methods, and 765 pseudogenes. One-quarter of the protein-coding genes shows overlap with other genes. Of the 856 olfactory receptor genes in the human genome, more than 40% are located in 28 single- and multi-gene clusters along this chromosome. Out of the 171 disorders currently attributed to the chromosome, 86 remain for which the underlying molecular basis is not yet known, including several mendelian traits, cancer and susceptibility loci. The high-quality data presented here--nearly 134.5 million base pairs representing 99.8% coverage of the euchromatic sequence--provide scientists with a solid foundation for understanding the genetic basis of these disorders and other biological phenomena.


Subject(s)
Chromosomes, Human, Pair 11 , Sequence Analysis, DNA , DNA , Gene Expression , Genes , Humans , Molecular Sequence Data , Physical Chromosome Mapping , Receptors, Odorant/genetics
SELECTION OF CITATIONS
SEARCH DETAIL