Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 8 de 8
Filtrar
1.
Cell ; 178(4): 850-866.e26, 2019 08 08.
Artículo en Inglés | MEDLINE | ID: mdl-31398340

RESUMEN

We performed a comprehensive assessment of rare inherited variation in autism spectrum disorder (ASD) by analyzing whole-genome sequences of 2,308 individuals from families with multiple affected children. We implicate 69 genes in ASD risk, including 24 passing genome-wide Bonferroni correction and 16 new ASD risk genes, most supported by rare inherited variants, a substantial extension of previous findings. Biological pathways enriched for genes harboring inherited variants represent cytoskeletal organization and ion transport, which are distinct from pathways implicated in previous studies. Nevertheless, the de novo and inherited genes contribute to a common protein-protein interaction network. We also identified structural variants (SVs) affecting non-coding regions, implicating recurrent deletions in the promoters of DLG2 and NR3C2. Loss of nr3c2 function in zebrafish disrupts sleep and social function, overlapping with human ASD-related phenotypes. These data support the utility of studying multiplex families in ASD and are available through the Hartwell Autism Research and Technology portal.


Asunto(s)
Trastorno del Espectro Autista/genética , Predisposición Genética a la Enfermedad/genética , Linaje , Mapas de Interacción de Proteínas/genética , Animales , Niño , Bases de Datos Genéticas , Modelos Animales de Enfermedad , Femenino , Eliminación de Gen , Guanilato-Quinasas/genética , Humanos , Patrón de Herencia/genética , Aprendizaje Automático , Masculino , Núcleo Familiar , Regiones Promotoras Genéticas/genética , Receptores de Mineralocorticoides/genética , Factores de Riesgo , Proteínas Supresoras de Tumor/genética , Secuenciación Completa del Genoma , Pez Cebra/genética
2.
Genome Res ; 25(10): 1570-80, 2015 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-26286554

RESUMEN

Although an increasing amount of human genetic variation is being identified and recorded, determining variants within repeated sequences of the human genome remains a challenge. Most population and genome-wide association studies have therefore been unable to consider variation in these regions. Core to the problem is the lack of a sequencing technology that produces reads with sufficient length and accuracy to enable unique mapping. Here, we present a novel methodology of using read clouds, obtained by accurate short-read sequencing of DNA derived from long fragment libraries, to confidently align short reads within repeat regions and enable accurate variant discovery. Our novel algorithm, Random Field Aligner (RFA), captures the relationships among the short reads governed by the long read process via a Markov Random Field. We utilized a modified version of the Illumina TruSeq synthetic long-read protocol, which yielded shallow-sequenced read clouds. We test RFA through extensive simulations and apply it to discover variants on the NA12878 human sample, for which shallow TruSeq read cloud sequencing data are available, and on an invasive breast carcinoma genome that we sequenced using the same method. We demonstrate that RFA facilitates accurate recovery of variation in 155 Mb of the human genome, including 94% of 67 Mb of segmental duplication sequence and 96% of 11 Mb of transcribed sequence, that are currently hidden from short-read technologies.


Asunto(s)
Variación Genética , Genoma Humano , Análisis de Secuencia de ADN/métodos , Algoritmos , Carcinoma Ductal/genética , Carcinoma Ductal de Mama/genética , Fragmentación del ADN , Humanos , Alineación de Secuencia/métodos
3.
Genome Res ; 23(7): 1097-108, 2013 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-23568837

RESUMEN

Cancer evolution involves cycles of genomic damage, epigenetic deregulation, and increased cellular proliferation that eventually culminate in the carcinoma phenotype. Early neoplasias, which are often found concurrently with carcinomas and are histologically distinguishable from normal breast tissue, are less advanced in phenotype than carcinomas and are thought to represent precursor stages. To elucidate their role in cancer evolution we performed comparative whole-genome sequencing of early neoplasias, matched normal tissue, and carcinomas from six patients, for a total of 31 samples. By using somatic mutations as lineage markers we built trees that relate the tissue samples within each patient. On the basis of these lineage trees we inferred the order, timing, and rates of genomic events. In four out of six cases, an early neoplasia and the carcinoma share a mutated common ancestor with recurring aneuploidies, and in all six cases evolution accelerated in the carcinoma lineage. Transition spectra of somatic mutations are stable and consistent across cases, suggesting that accumulation of somatic mutations is a result of increased ancestral cell division rather than specific mutational mechanisms. In contrast to highly advanced tumors that are the focus of much of the current cancer genome sequencing, neither the early neoplasia genomes nor the carcinomas are enriched with potentially functional somatic point mutations. Aneuploidies that occur in common ancestors of neoplastic and tumor cells are the earliest events that affect a large number of genes and may predispose breast tissue to eventual development of invasive carcinoma.


Asunto(s)
Neoplasias de la Mama/genética , Transformación Celular Neoplásica/genética , Genoma Humano , Mutación , Alelos , Aneuploidia , Neoplasias de la Mama/patología , Carcinoma/genética , Carcinoma/patología , Progresión de la Enfermedad , Femenino , Frecuencia de los Genes , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Polimorfismo de Nucleótido Simple
4.
Bioinformatics ; 27(13): i333-41, 2011 Jul 01.
Artículo en Inglés | MEDLINE | ID: mdl-21685089

RESUMEN

MOTIVATION: Accurate inference of genealogical relationships between pairs of individuals is paramount in association studies, forensics and evolutionary analyses of wildlife populations. Current methods for relationship inference consider only a small set of close relationships and have limited to no power to distinguish between relationships with the same number of meioses separating the individuals under consideration (e.g. aunt-niece versus niece-aunt or first cousins versus great aunt-niece). RESULTS: We present CARROT (ClAssification of Relationships with ROTations), a novel framework for relationship inference that leverages linkage information to differentiate between rotated relationships, that is, between relationships with the same number of common ancestors and the same number of meioses separating the individuals under consideration. We demonstrate that CARROT clearly outperforms existing methods on simulated data. We also applied CARROT on four populations from Phase III of the HapMap Project and detected previously unreported pairs of third- and fourth-degree relatives. AVAILABILITY: Source code for CARROT is freely available at http://carrot.stanford.edu. CONTACT: sofiakp@stanford.edu.


Asunto(s)
Algoritmos , Genealogía y Heráldica , Animales , Humanos , Cadenas de Markov
5.
Pac Symp Biocomput ; 22: 521-532, 2017.
Artículo en Inglés | MEDLINE | ID: mdl-27897003

RESUMEN

Autism has been shown to have a major genetic risk component; the architecture of documented autism in families has been over and again shown to be passed down for generations. While inherited risk plays an important role in the autistic nature of children, de novo (germline) mutations have also been implicated in autism risk. Here we find that autism de novo variants verified and published in the literature are Bonferroni-significantly enriched in a gene set implicated in synaptic elimination. Additionally, several of the genes in this synaptic elimination set that were enriched in protein-protein interactions (CACNA1C, SHANK2, SYNGAP1, NLGN3, NRXN1, and PTEN) have been previously confirmed as genes that confer risk for the disorder. The results demonstrate that autism-associated de novos are linked to proper synaptic pruning and density, hinting at the etiology of autism and suggesting pathophysiology for downstream correction and treatment.


Asunto(s)
Trastorno Autístico/genética , Mutación de Línea Germinal , Trastorno Autístico/patología , Biología Computacional , Bases de Datos Genéticas , Sinapsis Eléctricas/genética , Sinapsis Eléctricas/patología , Femenino , Redes Reguladoras de Genes , Predisposición Genética a la Enfermedad , Humanos , Masculino , Modelos Genéticos , Modelos Neurológicos
6.
Genome Biol ; 16: 91, 2015 May 06.
Artículo en Inglés | MEDLINE | ID: mdl-25944252

RESUMEN

Somatic variants can be used as lineage markers for the phylogenetic reconstruction of cancer evolution. Since somatic phylogenetics is complicated by sample heterogeneity, novel specialized tree-building methods are required for cancer phylogeny reconstruction. We present LICHeE (Lineage Inference for Cancer Heterogeneity and Evolution), a novel method that automates the phylogenetic inference of cancer progression from multiple somatic samples. LICHeE uses variant allele frequencies of somatic single nucleotide variants obtained by deep sequencing to reconstruct multi-sample cell lineage trees and infer the subclonal composition of the samples. LICHeE is open source and available at http://viq854.github.io/lichee .


Asunto(s)
Linaje de la Célula/genética , Variación Genética , Neoplasias/genética , Algoritmos , Carcinoma de Células Renales/genética , Biología Computacional/métodos , Simulación por Computador , Progresión de la Enfermedad , Femenino , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Neoplasias Renales/genética , Neoplasias Ováricas/genética , Filogenia , Programas Informáticos , Ensayos Antitumor por Modelo de Xenoinjerto
7.
Genome Med ; 7(1): 28, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-25918554

RESUMEN

BACKGROUND: All cells in an individual are related to one another by a bifurcating lineage tree, in which each node is an ancestral cell that divided into two, each branch connects two nodes, and the root is the zygote. When a somatic mutation occurs in an ancestral cell, all its descendants carry the mutation, which can then serve as a lineage marker for the phylogenetic reconstruction of tumor progression. Using this concept, we investigate cell lineage relationships and genetic heterogeneity of pre-invasive neoplasias compared to invasive carcinomas. METHODS: We deeply sequenced over a thousand phylogenetically informative somatic variants in 66 morphologically independent samples from six patients that represent a spectrum of normal, early neoplasia, carcinoma in situ, and invasive carcinoma. For each patient, we obtained a highly resolved lineage tree that establishes the phylogenetic relationships among the pre-invasive lesions and with the invasive carcinoma. RESULTS: The trees reveal lineage heterogeneity of pre-invasive lesions, both within the same lesion, and between histologically similar ones. On the basis of the lineage trees, we identified a large number of independent recurrences of PIK3CA H1047 mutations in separate lesions in four of the six patients, often separate from the diagnostic carcinoma. CONCLUSIONS: Our analyses demonstrate that multi-sample phylogenetic inference provides insights on the origin of driver mutations, lineage heterogeneity of neoplastic proliferations, and the relationship of genomically aberrant neoplasias with the primary tumors. PIK3CA driver mutations may be comparatively benign inducers of cellular proliferation.

8.
J Comput Biol ; 20(11): 933-44, 2013 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-24195709

RESUMEN

Next-generation sequencing technologies provide a powerful tool for studying genome evolution during progression of advanced diseases such as cancer. Although many recent studies have employed new sequencing technologies to detect mutations across multiple, genetically related tumors, current methods do not exploit available phylogenetic information to improve the accuracy of their variant calls. Here, we present a novel algorithm that uses somatic single-nucleotide variations (SNVs) in multiple, related tissue samples as lineage markers for phylogenetic tree reconstruction. Our method then leverages the inferred phylogeny to improve the accuracy of SNV discovery. Experimental analyses demonstrate that our method achieves up to 32% improvement for somatic SNV calling of multiple, related samples over the accuracy of GATK's Unified Genotyper, the state-of-the-art multisample SNV caller.


Asunto(s)
Análisis Mutacional de ADN , Neoplasias/genética , Polimorfismo de Nucleótido Simple , Algoritmos , Simulación por Computador , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Modelos Genéticos , Mutación , Filogenia
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA