RESUMO
Schizophrenia is a common disease with a complex aetiology, probably involving multiple and heterogeneous genetic factors. Here, by analysing the exome sequences of 2,536 schizophrenia cases and 2,543 controls, we demonstrate a polygenic burden primarily arising from rare (less than 1 in 10,000), disruptive mutations distributed across many genes. Particularly enriched gene sets include the voltage-gated calcium ion channel and the signalling complex formed by the activity-regulated cytoskeleton-associated scaffold protein (ARC) of the postsynaptic density, sets previously implicated by genome-wide association and copy-number variation studies. Similar to reports in autism, targets of the fragile X mental retardation protein (FMRP, product of FMR1) are enriched for case mutations. No individual gene-based test achieves significance after correction for multiple testing and we do not detect any alleles of moderately low frequency (approximately 0.5 to 1 per cent) and moderately large effect. Taken together, these data suggest that population-based exome sequencing can discover risk alleles and complements established gene-mapping paradigms in neuropsychiatric disease.
Assuntos
Herança Multifatorial/genética , Mutação/genética , Esquizofrenia/genética , Transtorno Autístico/genética , Canais de Cálcio/genética , Proteínas do Citoesqueleto/genética , Variações do Número de Cópias de DNA/genética , Proteína 4 Homóloga a Disks-Large , Feminino , Proteína do X Frágil da Deficiência Intelectual/metabolismo , Estudo de Associação Genômica Ampla , Humanos , Deficiência Intelectual/genética , Peptídeos e Proteínas de Sinalização Intracelular/genética , Masculino , Proteínas de Membrana/genética , Proteínas do Tecido Nervoso/genética , Receptores de N-Metil-D-Aspartato/genéticaRESUMO
[3 H]Genipin was synthesized in a single step by Ir(I) catalyzed hydrogen isotope exchange. Conditions for selective exchange of the sp2 CH bond ortho to the methyl ester functionality were developed through deuterium modeling studies through a catalyst screen. Optimized conditions so obtained were then utilized with tritium gas to generate [3 H]genipin at a specific activity of 18.5 Ci/mmol. Racemic [14 C]genipin was prepared in eight steps in overall 5.4% radiochemical yield from potassium [14 C]cyanide.
Assuntos
Radioisótopos de Carbono/química , Iridoides/química , Iridoides/síntese química , Trítio/química , Catálise , Técnicas de Química Sintética , Irídio/química , Marcação por Isótopo , RadioquímicaRESUMO
Autism spectrum disorders (ASD) are believed to have genetic and environmental origins, yet in only a modest fraction of individuals can specific causes be identified. To identify further genetic risk factors, here we assess the role of de novo mutations in ASD by sequencing the exomes of ASD cases and their parents (n = 175 trios). Fewer than half of the cases (46.3%) carry a missense or nonsense de novo variant, and the overall rate of mutation is only modestly higher than the expected rate. In contrast, the proteins encoded by genes that harboured de novo missense or nonsense mutations showed a higher degree of connectivity among themselves and to previous ASD genes as indexed by protein-protein interaction screens. The small increase in the rate of de novo events, when taken together with the protein interaction results, are consistent with an important but limited role for de novo point mutations in ASD, similar to that documented for de novo copy number variants. Genetic models incorporating these data indicate that most of the observed de novo events are unconnected to ASD; those that do confer risk are distributed across many genes and are incompletely penetrant (that is, not necessarily sufficient for disease). Our results support polygenic models in which spontaneous coding mutations in any of a large number of genes increases risk by 5- to 20-fold. Despite the challenge posed by such models, results from de novo events and a large parallel case-control study provide strong evidence in favour of CHD8 and KATNAL2 as genuine autism risk factors.
Assuntos
Transtorno Autístico/genética , Proteínas de Ligação a DNA/genética , Éxons/genética , Predisposição Genética para Doença/genética , Mutação/genética , Fatores de Transcrição/genética , Estudos de Casos e Controles , Exoma/genética , Saúde da Família , Humanos , Modelos Genéticos , Herança Multifatorial/genética , Fenótipo , Distribuição de Poisson , Mapas de Interação de ProteínasRESUMO
We sequenced all protein-coding regions of the genome (the "exome") in two family members with combined hypolipidemia, marked by extremely low plasma levels of low-density lipoprotein (LDL) cholesterol, high-density lipoprotein (HDL) cholesterol, and triglycerides. These two participants were compound heterozygotes for two distinct nonsense mutations in ANGPTL3 (encoding the angiopoietin-like 3 protein). ANGPTL3 has been reported to inhibit lipoprotein lipase and endothelial lipase, thereby increasing plasma triglyceride and HDL cholesterol levels in rodents. Our finding of ANGPTL3 mutations highlights a role for the gene in LDL cholesterol metabolism in humans and shows the usefulness of exome sequencing for identification of novel genetic causes of inherited disorders. (Funded by the National Human Genome Research Institute and others.).
Assuntos
Angiopoietinas/genética , Códon sem Sentido , Hipobetalipoproteinemias/genética , Proteína 3 Semelhante a Angiopoietina , Proteínas Semelhantes a Angiopoietina , HDL-Colesterol/sangue , HDL-Colesterol/genética , LDL-Colesterol/sangue , LDL-Colesterol/genética , Análise Mutacional de DNA , Feminino , Ligação Genética , Humanos , Masculino , LinhagemRESUMO
SUMMARY: Here, we present ContEst, a tool for estimating the level of cross-individual contamination in next-generation sequencing data. We demonstrate the accuracy of ContEst across a range of contamination levels, sources and read depths using sequencing data mixed in silico at known concentrations. We applied our tool to published cancer sequencing datasets and report their estimated contamination levels. AVAILABILITY AND IMPLEMENTATION: ContEst is a GATK module, and distributed under a BSD style license at http://www.broadinstitute.org/cancer/cga/contest CONTACT: kcibul@broadinstitute.org; gadgetz@broadinstitute.org SUPPLEMENTARY INFORMATION: Supplementary data is available at Bioinformatics online.
Assuntos
Neoplasias/genética , Análise de Sequência de DNA/métodos , Sequência de Bases , Teorema de Bayes , Reações Falso-Positivas , Genótipo , Humanos , Modelos Genéticos , SoftwareRESUMO
Forensic genetic genealogy (FGG) has primarily relied upon dense single nucleotide polymorphism (SNP) profiles from forensic samples or unidentified human remains queried against online genealogy database(s) of known profiles generated with SNP microarrays or from whole genome sequencing (WGS). In these queries, SNPs are compared to database samples by locating contiguous stretches of shared SNP alleles that allow for detection of genomic segments that are identical by descent (IBD) among biological relatives (kinship). This segment-based approach, while robust for detecting distant relationships, generally requires DNA quantity and/or quality that are sometimes not available in forensic casework samples. By focusing on SNPs with maximal discriminatory power and using an algorithm designed for a sparser SNP set than those from microarray typing, performance similar to segment matching was reached even in difficult casework samples. This algorithm locates shared segments using kinship coefficients in "windows" across the genome. The windowed kinship algorithm is a modification of the PC-AiR and PC-Relate tools for genetic relatedness inference, referred to here as the "whole genome kinship" approach, that control for the presence of unknown or unspecified population substructure. Simulated and empirical data in this study, using DNA profiles comprised of 10,230 SNPs (10K multiplex) targeted by the ForenSeq™ Kintelligence Kit demonstrate that the windowed kinship approach performs comparably to segment matching for identifying first, second and third degree relationships, reasonably well for fourth degree relationships, and with fewer false kinship associations. Selection criteria for the 10K SNP PCR-based multiplex and functionality of the windowed kinship algorithm are described.
Assuntos
Impressões Digitais de DNA , Polimorfismo de Nucleotídeo Único , Humanos , Linhagem , Alelos , Reação em Cadeia da PolimeraseRESUMO
Oncogenic activation of tyrosine kinases is a common mechanism of carcinogenesis and, given the druggable nature of these enzymes, an attractive target for anticancer therapy. Here, we show that somatic mutations of the fibroblast growth factor receptor 2 (FGFR2) tyrosine kinase gene, FGFR2, are present in 12% of endometrial carcinomas, with additional instances found in lung squamous cell carcinoma and cervical carcinoma. These FGFR2 mutations, many of which are identical to mutations associated with congenital craniofacial developmental disorders, are constitutively activated and oncogenic when ectopically expressed in NIH 3T3 cells. Inhibition of FGFR2 kinase activity in endometrial carcinoma cell lines bearing such FGFR2 mutations inhibits transformation and survival, implicating FGFR2 as a novel therapeutic target in endometrial carcinoma.
Assuntos
Carcinoma/genética , Neoplasias do Endométrio/genética , Mutação , Receptor Tipo 2 de Fator de Crescimento de Fibroblastos/genética , Animais , Carcinoma/tratamento farmacológico , Carcinoma/metabolismo , Linhagem Celular Tumoral , Proliferação de Células , Sobrevivência Celular , Neoplasias do Endométrio/tratamento farmacológico , Neoplasias do Endométrio/metabolismo , Feminino , Camundongos , Células NIH 3T3 , Receptor Tipo 2 de Fator de Crescimento de Fibroblastos/antagonistas & inibidores , Receptor Tipo 2 de Fator de Crescimento de Fibroblastos/metabolismo , TransfecçãoRESUMO
We describe CALITAS, a CRISPR-Cas-aware aligner and integrated off-target search algorithm. CALITAS uses a modified and CRISPR-tuned version of the Needleman-Wunsch algorithm. It supports an unlimited number of mismatches and gaps and allows protospacer adjacent motif (PAM) mismatches or PAMless searches. CALITAS also includes an exhaustive search routine to scan genomes and genome variants provided with a standard Variant Call Format file. By default, CALITAS returns a single best alignment for a given off-target site, which is a significant improvement compared to other off-target algorithms, and it enables off-targets to be referenced directly using alignment coordinates. We validate and compare CALITAS using a selected set of target sites, as well as experimentally derived specificity data sets. In summary, CALITAS is a new tool for precise and relevant alignments and identification of candidate off-target sites across a genome. We believe it is the state of the art for CRISPR-Cas specificity assessments.
Assuntos
Sistemas CRISPR-Cas , Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas , Simulação por Computador , Endonucleases/genética , Algoritmos , Proteínas de Bactérias , Proteína 9 Associada à CRISPR , Proteínas Associadas a CRISPR , Endodesoxirribonucleases , Edição de Genes , Genoma , RNA Guia de Cinetoplastídeos/genética , SoftwareRESUMO
SUMMARY: The Sequence Alignment/Map (SAM) format is a generic alignment format for storing read alignments against reference sequences, supporting short and long reads (up to 128 Mbp) produced by different sequencing platforms. It is flexible in style, compact in size, efficient in random access and is the format in which alignments from the 1000 Genomes Project are released. SAMtools implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments. AVAILABILITY: http://samtools.sourceforge.net.
Assuntos
Biologia Computacional/métodos , Alinhamento de Sequência/métodos , Análise de Sequência de DNA/métodos , Software , Algoritmos , Sequência de Bases , Genoma , Genômica , Dados de Sequência MolecularRESUMO
BACKGROUND: Forward genetic screens in mice provide an unbiased means to identify genes and other functional genetic elements in the genome. Previously, a large scale ENU mutagenesis screen was conducted to query the functional content of a ~50 Mb region of the mouse genome on proximal Chr 5. The majority of phenotypic mutants recovered were embryonic lethals. RESULTS: We report the high resolution genetic mapping, complementation analyses, and positional cloning of mutations in the target region. The collection of identified alleles include several with known or presumed functions for which no mutant models have been reported (Tbc1d14, Nol14, Tyms, Cad, Fbxl5, Haus3), and mutations in genes we or others previously reported (Tapt1, Rest, Ugdh, Paxip1, Hmx1, Otoe, Nsun7). We also confirmed the causative nature of a homeotic mutation with a targeted allele, mapped a lethal mutation to a large gene desert, and localized a spermiogenesis mutation to a region in which no annotated genes have coding mutations. The mutation in Tbc1d14 provides the first implication of a critical developmental role for RAB-GAP-mediated protein transport in early embryogenesis. CONCLUSION: This collection of alleles contributes to the goal of assigning biological functions to all known genes, as well as identifying novel functional elements that would be missed by reverse genetic approaches.
Assuntos
Mapeamento Cromossômico , Cromossomos/efeitos dos fármacos , Análise Mutacional de DNA , Desenvolvimento Embrionário/genética , Camundongos/genética , Mutação , Animais , Clonagem Molecular , Etilnitrosoureia/toxicidade , Genes Letais , Teste de Complementação Genética , Masculino , Camundongos Endogâmicos C57BL , Deleção de Sequência , Espermatogênese/genéticaRESUMO
As the number of genomics datasets grows rapidly, sample mislabeling has become a high stakes issue. We present CrosscheckFingerprints (Crosscheck), a tool for quantifying sample-relatedness and detecting incorrectly paired sequencing datasets from different donors. Crosscheck outperforms similar methods and is effective even when data are sparse or from different assays. Application of Crosscheck to 8851 ENCODE ChIP-, RNA-, and DNase-seq datasets enabled us to identify and correct dozens of mislabeled samples and ambiguous metadata annotations, representing ~1% of ENCODE datasets.
Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Desequilíbrio de Ligação/genética , Bases de Dados de Ácidos Nucleicos , Genótipo , Células HEK293 , Células Endoteliais da Veia Umbilical Humana/metabolismo , Humanos , Células K562 , Escore Lod , Anotação de Sequência MolecularRESUMO
Leber congenital amaurosis type 10 is a severe retinal dystrophy caused by mutations in the CEP290 gene1,2. We developed EDIT-101, a candidate genome-editing therapeutic, to remove the aberrant splice donor created by the IVS26 mutation in the CEP290 gene and restore normal CEP290 expression. Key to this therapeutic, we identified a pair of Staphylococcus aureus Cas9 guide RNAs that were highly active and specific to the human CEP290 target sequence. In vitro experiments in human cells and retinal explants demonstrated the molecular mechanism of action and nuclease specificity. Subretinal delivery of EDIT-101 in humanized CEP290 mice showed rapid and sustained CEP290 gene editing. A comparable surrogate non-human primate (NHP) vector also achieved productive editing of the NHP CEP290 gene at levels that met the target therapeutic threshold, and demonstrated the ability of CRISPR/Cas9 to edit somatic primate cells in vivo. These results support further development of EDIT-101 for LCA10 and additional CRISPR-based medicines for other inherited retinal disorders.
Assuntos
Edição de Genes , Amaurose Congênita de Leber/genética , Amaurose Congênita de Leber/fisiopatologia , Animais , Linhagem Celular , Técnicas de Introdução de Genes , Humanos , Camundongos , Primatas , Reprodutibilidade dos Testes , Visão OcularRESUMO
Antagonists of the CB1 receptor can be useful in the treatment of several important disorders. However, to date, the only clinically approved CB1 receptor antagonist, rimonabant, was withdrawn because of adverse central nervous system (CNS)-related side effects. Since rimonabant's withdrawal, several groups are pursuing peripherally selective CB1 antagonists. These compounds are expected to be devoid of undesirable CNS-related effects but maintain efficacy through antagonism of peripherally expressed CB1 receptors. Reported here are our latest results toward the development of a peripherally selective analog of the diphenyl purine CB1 antagonist otenabant 1. Compound 9 (N-{1-[8-(2-chlorophenyl)-9-(4-chlorophenyl)-9H-purin-6-yl]piperidin-4-yl}pentanamide) is a potent, orally absorbed antagonist of the CB1 receptor that is >50-fold selective for CB1 over CB2, highly selective for the periphery in a rodent model, and without efficacy in a series of in vivo assays designed to evaluate its ability to mitigate the central effects of Δ(9)-tetrahydrocannabinol through the CB1 receptor.
Assuntos
Compostos de Bifenilo/farmacologia , Purinas/farmacologia , Receptor CB1 de Canabinoide/antagonistas & inibidores , Receptor CB2 de Canabinoide/antagonistas & inibidores , Animais , Área Sob a Curva , Ligação Competitiva , Compostos de Bifenilo/química , Compostos de Bifenilo/farmacocinética , Temperatura Corporal/efeitos dos fármacos , Masculino , Camundongos , Camundongos Endogâmicos ICR , Estrutura Molecular , Atividade Motora/efeitos dos fármacos , Piperidinas/química , Piperidinas/farmacocinética , Piperidinas/farmacologia , Purinas/química , Purinas/farmacocinética , Ratos , Ratos Sprague-Dawley , Receptor CB1 de Canabinoide/metabolismo , Receptor CB2 de Canabinoide/metabolismoRESUMO
Cannabinoid receptor 1 (CB1) antagonists are potentially useful for the treatment of several diseases. However, clinical development of several CB1 antagonists was halted due to central nervous system (CNS)-related side effects including depression and suicidal ideation in some users. Recently, studies have indicated that selective regulation of CB1 receptors in the periphery is a viable strategy for treating several important disorders. Past efforts to develop peripherally selective antagonists of CB1 have largely targeted rimonabant, an inverse agonist of CB1. Reported here are our efforts toward developing a peripherally selective CB1 antagonist based on the otenabant scaffold. Even though otenabant penetrates the CNS, it is unique among CB1 antagonists that have been clinically tested because it has properties that are normally associated with peripherally selective compounds. Our efforts have resulted in an orally absorbed compound that is a potent and selective CB1 antagonist with limited penetration into the CNS.
Assuntos
Compostos de Bifenilo/química , Encéfalo/efeitos dos fármacos , Antagonistas de Receptores de Canabinoides/farmacologia , Desenho de Fármacos , Purinas/química , Receptor CB1 de Canabinoide/antagonistas & inibidores , Sulfonamidas/farmacologia , Membro 1 da Subfamília B de Cassetes de Ligação de ATP/metabolismo , Animais , Cálcio/metabolismo , Antagonistas de Receptores de Canabinoides/síntese química , Permeabilidade da Membrana Celular/efeitos dos fármacos , Células Cultivadas , Cães , Masculino , Modelos Moleculares , Estrutura Molecular , Purinas/síntese química , Purinas/farmacologia , Ensaio Radioligante , Ratos , Ratos Sprague-Dawley , Receptor CB1 de Canabinoide/metabolismo , Relação Estrutura-Atividade , Sulfonamidas/síntese químicaRESUMO
Antagonists of cannabinoid receptor 1 (CB1) have potential for the treatment of several diseases such as obesity, liver disease, and diabetes. Recently, development of several CB1 antagonists was halted because of adverse central nervous system (CNS) related side effects observed with rimonabant, the first clinically approved CB1 inverse agonist. However, recent studies indicate that regulation of peripherally expressed CB1 with CNS-sparing compounds is a viable strategy to treat several important disorders. Our efforts aimed at rationally designing peripherally restricted CB1 antagonists have resulted in compounds that have limited blood-brain barrier (BBB) permeability and CNS exposure in preclinical in vitro and in vivo models. Typically, compounds with high topological polar surface areas (TPSAs) do not cross the BBB passively. Compounds with TPSAs higher than that for rimonabant (rimonabant TPSA = 50) and excellent functional activity with limited CNS penetration were identified. These compounds will serve as templates for further optimization.
Assuntos
Piperidinas/síntese química , Pirazóis/síntese química , Receptor CB1 de Canabinoide/antagonistas & inibidores , Ureia/análogos & derivados , Ureia/síntese química , Animais , Barreira Hematoencefálica/metabolismo , Cálcio/metabolismo , Linhagem Celular , Permeabilidade da Membrana Celular , Cães , Desenho de Fármacos , Humanos , Ligantes , Masculino , Piperidinas/química , Piperidinas/farmacologia , Pirazóis/química , Pirazóis/farmacologia , Ensaio Radioligante , Ratos , Ratos Sprague-Dawley , Estereoisomerismo , Relação Estrutura-Atividade , Ureia/química , Ureia/farmacologiaRESUMO
Recent advances in sequencing technology make it possible to comprehensively catalog genetic variation in population samples, creating a foundation for understanding human disease, ancestry and evolution. The amounts of raw data produced are prodigious, and many computational steps are required to translate this output into high-quality variant calls. We present a unified analytic framework to discover and genotype variation among multiple samples simultaneously that achieves sensitive and specific results across five sequencing technologies and three distinct, canonical experimental designs. Our process includes (i) initial read mapping; (ii) local realignment around indels; (iii) base quality score recalibration; (iv) SNP discovery and genotyping to find all potential variants; and (v) machine learning to separate true segregating variation from machine artifacts common to next-generation sequencing technologies. We here discuss the application of these tools, instantiated in the Genome Analysis Toolkit, to deep whole-genome, whole-exome capture and multi-sample low-pass (â¼4×) 1000 Genomes Project datasets.
Assuntos
Variação Genética , Genótipo , Análise de Sequência de DNA/métodos , Interpretação Estatística de Dados , Bases de Dados de Ácidos Nucleicos , Éxons , Genética Populacional/métodos , Genética Populacional/estatística & dados numéricos , Genoma Humano , Humanos , Polimorfismo de Nucleotídeo Único , Alinhamento de Sequência/métodos , Alinhamento de Sequência/estatística & dados numéricos , Análise de Sequência de DNA/estatística & dados numéricos , SoftwareRESUMO
More than 1,000 susceptibility loci have been identified through genome-wide association studies (GWAS) of common variants; however, the specific genes and full allelic spectrum of causal variants underlying these findings have not yet been defined. Here we used pooled next-generation sequencing to study 56 genes from regions associated with Crohn's disease in 350 cases and 350 controls. Through follow-up genotyping of 70 rare and low-frequency protein-altering variants in nine independent case-control series (16,054 Crohn's disease cases, 12,153 ulcerative colitis cases and 17,575 healthy controls), we identified four additional independent risk factors in NOD2, two additional protective variants in IL23R, a highly significant association with a protective splice variant in CARD9 (P < 1 × 10(-16), odds ratio ≈ 0.29) and additional associations with coding variants in IL18RAP, CUL2, C1orf106, PTPN22 and MUC19. We extend the results of successful GWAS by identifying new, rare and probably functional variants that could aid functional experiments and predictive models.
Assuntos
Estudo de Associação Genômica Ampla , Doenças Inflamatórias Intestinais/genética , Análise de Sequência de DNA , Estudos de Casos e Controles , Linhagem Celular , Predisposição Genética para Doença , Humanos , Proteína Adaptadora de Sinalização NOD2/genética , Splicing de RNA , Receptores de Interleucina/genéticaRESUMO
This unit describes a protocol for the targeted enrichment of exons from randomly sheared genomic DNA libraries using an in-solution hybrid selection approach for sequencing on an Illumina Genome Analyzer II. The steps for designing and ordering a hybrid selection oligo pool are reviewed, as are critical steps for performing the preparation and hybrid selection of an Illumina paired-end library. Critical parameters, performance metrics, and analysis workflow are discussed.
Assuntos
Éxons/genética , Hibridização de Ácido Nucleico/métodos , Análise de Sequência de DNA/métodos , Humanos , SoluçõesRESUMO
Soft-tissue sarcomas, which result in approximately 10,700 diagnoses and 3,800 deaths per year in the United States, show remarkable histologic diversity, with more than 50 recognized subtypes. However, knowledge of their genomic alterations is limited. We describe an integrative analysis of DNA sequence, copy number and mRNA expression in 207 samples encompassing seven major subtypes. Frequently mutated genes included TP53 (17% of pleomorphic liposarcomas), NF1 (10.5% of myxofibrosarcomas and 8% of pleomorphic liposarcomas) and PIK3CA (18% of myxoid/round-cell liposarcomas, or MRCs). PIK3CA mutations in MRCs were associated with Akt activation and poor clinical outcomes. In myxofibrosarcomas and pleomorphic liposarcomas, we found both point mutations and genomic deletions affecting the tumor suppressor NF1. Finally, we found that short hairpin RNA (shRNA)-based knockdown of several genes amplified in dedifferentiated liposarcoma, including CDK4 and YEATS4, decreased cell proliferation. Our study yields a detailed map of molecular alterations across diverse sarcoma subtypes and suggests potential subtype-specific targets for therapy.