Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 14 de 14
Filtrar
1.
Cell ; 185(18): 3426-3440.e19, 2022 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-36055201

RESUMO

The 1000 Genomes Project (1kGP) is the largest fully open resource of whole-genome sequencing (WGS) data consented for public distribution without access or use restrictions. The final, phase 3 release of the 1kGP included 2,504 unrelated samples from 26 populations and was based primarily on low-coverage WGS. Here, we present a high-coverage 3,202-sample WGS 1kGP resource, which now includes 602 complete trios, sequenced to a depth of 30X using Illumina. We performed single-nucleotide variant (SNV) and short insertion and deletion (INDEL) discovery and generated a comprehensive set of structural variants (SVs) by integrating multiple analytic methods through a machine learning model. We show gains in sensitivity and precision of variant calls compared to phase 3, especially among rare SNVs as well as INDELs and SVs spanning frequency spectrum. We also generated an improved reference imputation panel, making variants discovered here accessible for association studies.


Assuntos
Genoma Humano , Sequenciamento Completo do Genoma , Feminino , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Mutação INDEL , Masculino , Polimorfismo de Nucleotídeo Único
2.
Genome Res ; 27(11): 1895-1903, 2017 11.
Artigo em Inglês | MEDLINE | ID: mdl-28887402

RESUMO

Identifying large expansions of short tandem repeats (STRs), such as those that cause amyotrophic lateral sclerosis (ALS) and fragile X syndrome, is challenging for short-read whole-genome sequencing (WGS) data. A solution to this problem is an important step toward integrating WGS into precision medicine. We developed a software tool called ExpansionHunter that, using PCR-free WGS short-read data, can genotype repeats at the locus of interest, even if the expanded repeat is larger than the read length. We applied our algorithm to WGS data from 3001 ALS patients who have been tested for the presence of the C9orf72 repeat expansion with repeat-primed PCR (RP-PCR). Compared against this truth data, ExpansionHunter correctly classified all (212/212, 95% CI [0.98, 1.00]) of the expanded samples as either expansions (208) or potential expansions (4). Additionally, 99.9% (2786/2789, 95% CI [0.997, 1.00]) of the wild-type samples were correctly classified as wild type by this method with the remaining three samples identified as possible expansions. We further applied our algorithm to a set of 152 samples in which every sample had one of eight different pathogenic repeat expansions, including those associated with fragile X syndrome, Friedreich's ataxia, and Huntington's disease, and correctly flagged all but one of the known repeat expansions. Thus, ExpansionHunter can be used to accurately detect known pathogenic repeat expansions and provides researchers with a tool that can be used to identify new pathogenic repeat expansions.


Assuntos
Esclerose Lateral Amiotrófica/genética , Expansão das Repetições de DNA , Sequenciamento Completo do Genoma/métodos , Algoritmos , Proteína C9orf72/genética , Bases de Dados Genéticas , Humanos , Medicina de Precisão , Sensibilidade e Especificidade , Software
3.
PLoS Genet ; 11(12): e1005698, 2015 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-26641248

RESUMO

In Caenorhabditis elegans, the dosage compensation complex (DCC) specifically binds to and represses transcription from both X chromosomes in hermaphrodites. The DCC is composed of an X-specific condensin complex that interacts with several proteins. During embryogenesis, DCC starts localizing to the X chromosomes around the 40-cell stage, and is followed by X-enrichment of H4K20me1 between 100-cell to comma stage. Here, we analyzed dosage compensation of the X chromosome between sexes, and the roles of dpy-27 (condensin subunit), dpy-21 (non-condensin DCC member), set-1 (H4K20 monomethylase) and set-4 (H4K20 di-/tri-methylase) in X chromosome repression using mRNA-seq and ChIP-seq analyses across several developmental time points. We found that the DCC starts repressing the X chromosomes by the 40-cell stage, but X-linked transcript levels remain significantly higher in hermaphrodites compared to males through the comma stage of embryogenesis. Dpy-27 and dpy-21 are required for X chromosome repression throughout development, but particularly in early embryos dpy-27 and dpy-21 mutations produced distinct expression changes, suggesting a DCC independent role for dpy-21. We previously hypothesized that the DCC increases H4K20me1 by reducing set-4 activity on the X chromosomes. Accordingly, in the set-4 mutant, H4K20me1 increased more from the autosomes compared to the X, equalizing H4K20me1 level between X and autosomes. H4K20me1 increase on the autosomes led to a slight repression, resulting in a relative effect of X derepression. H4K20me1 depletion in the set-1 mutant showed greater X derepression compared to equalization of H4K20me1 levels between X and autosomes in the set-4 mutant, indicating that H4K20me1 level is important, but X to autosomal balance of H4K20me1 contributes slightly to X-repression. Thus H4K20me1 is not only a downstream effector of the DCC [corrected].In summary, X chromosome dosage compensation starts in early embryos as the DCC localizes to the X, and is strengthened in later embryogenesis by H4K20me1.


Assuntos
Proteínas de Caenorhabditis elegans/genética , Proteínas de Transporte/genética , Mecanismo Genético de Compensação de Dose , Desenvolvimento Embrionário , Histona-Lisina N-Metiltransferase/genética , Proteínas Nucleares/genética , Animais , Caenorhabditis elegans , Cromatina/genética , Feminino , Masculino , Mutação , Cromossomo X/genética
5.
Nat Med ; 30(6): 1655-1666, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38877116

RESUMO

In solid tumor oncology, circulating tumor DNA (ctDNA) is poised to transform care through accurate assessment of minimal residual disease (MRD) and therapeutic response monitoring. To overcome the sparsity of ctDNA fragments in low tumor fraction (TF) settings and increase MRD sensitivity, we previously leveraged genome-wide mutational integration through plasma whole-genome sequencing (WGS). Here we now introduce MRD-EDGE, a machine-learning-guided WGS ctDNA single-nucleotide variant (SNV) and copy-number variant (CNV) detection platform designed to increase signal enrichment. MRD-EDGESNV uses deep learning and a ctDNA-specific feature space to increase SNV signal-to-noise enrichment in WGS by ~300× compared to previous WGS error suppression. MRD-EDGECNV also reduces the degree of aneuploidy needed for ultrasensitive CNV detection through WGS from 1 Gb to 200 Mb, vastly expanding its applicability within solid tumors. We harness the improved performance to identify MRD following surgery in multiple cancer types, track changes in TF in response to neoadjuvant immunotherapy in lung cancer and demonstrate ctDNA shedding in precancerous colorectal adenomas. Finally, the radical signal-to-noise enrichment in MRD-EDGESNV enables plasma-only (non-tumor-informed) disease monitoring in advanced melanoma and lung cancer, yielding clinically informative TF monitoring for patients on immune-checkpoint inhibition.


Assuntos
DNA Tumoral Circulante , Variações do Número de Cópias de DNA , Aprendizado de Máquina , Neoplasia Residual , Carga Tumoral , Humanos , DNA Tumoral Circulante/genética , DNA Tumoral Circulante/sangue , Neoplasia Residual/genética , Sequenciamento Completo do Genoma , Neoplasias/genética , Neoplasias/sangue , Neoplasias/terapia , Neoplasias/patologia , Polimorfismo de Nucleotídeo Único , Biomarcadores Tumorais/genética , Biomarcadores Tumorais/sangue , Neoplasias Colorretais/genética , Neoplasias Colorretais/sangue , Neoplasias Colorretais/patologia , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/sangue , Neoplasias Pulmonares/patologia
6.
NPJ Precis Oncol ; 7(1): 91, 2023 Sep 13.
Artigo em Inglês | MEDLINE | ID: mdl-37704749

RESUMO

Intracranial metastases in prostate cancer are uncommon but clinically aggressive. A detailed molecular characterization of prostate cancer intracranial metastases would improve our understanding of their pathogenesis and the search for new treatment strategies. We evaluated the clinical and molecular characteristics of 36 patients with metastatic prostate cancer to either the dura or brain parenchyma. We performed whole genome sequencing (WGS) of 10 intracranial prostate cancer metastases, as well as WGS of primary prostate tumors from men who later developed metastatic disease (n = 6) and nonbrain prostate cancer metastases (n = 36). This first whole genome sequencing study of prostate intracranial metastases led to several new insights. First, there was a higher diversity of complex structural alterations in prostate cancer intracranial metastases compared to primary tumor tissues. Chromothripsis and chromoplexy events seemed to dominate, yet there were few enrichments of specific categories of structural variants compared with non-brain metastases. Second, aberrations involving the AR gene, including AR enhancer gain were observed in 7/10 (70%) of intracranial metastases, as well as recurrent loss of function aberrations involving TP53 in 8/10 (80%), RB1 in 2/10 (20%), BRCA2 in 2/10 (20%), and activation of the PI3K/AKT/PTEN pathway in 8/10 (80%). These alterations were frequently present in tumor tissues from other sites of disease obtained concurrently or sequentially from the same individuals. Third, clonality analysis points to genomic factors and evolutionary bottlenecks that contribute to metastatic spread in patients with prostate cancer. These results describe the aggressive molecular features underlying intracranial metastasis that may inform future diagnostic and treatment approaches.

7.
Elife ; 112022 Nov 04.
Artigo em Inglês | MEDLINE | ID: mdl-36331876

RESUMO

Condensins are molecular motors that compact DNA via linear translocation. In Caenorhabditis elegans, the X-chromosome harbors a specialized condensin that participates in dosage compensation (DC). Condensin DC is recruited to and spreads from a small number of recruitment elements on the X-chromosome (rex) and is required for the formation of topologically associating domains (TADs). We take advantage of autosomes that are largely devoid of condensin DC and TADs to address how rex sites and condensin DC give rise to the formation of TADs. When an autosome and X-chromosome are physically fused, despite the spreading of condensin DC into the autosome, no TAD was created. Insertion of a strong rex on the X-chromosome results in the TAD boundary formation regardless of sequence orientation. When the same rex is inserted on an autosome, despite condensin DC recruitment, there was no spreading or features of a TAD. On the other hand, when a 'super rex' composed of six rex sites or three separate rex sites are inserted on an autosome, recruitment and spreading of condensin DC led to the formation of TADs. Therefore, recruitment to and spreading from rex sites are necessary and sufficient for recapitulating loop-anchored TADs observed on the X-chromosome. Together our data suggest a model in which rex sites are both loading sites and bidirectional barriers for condensin DC, a one-sided loop-extruder with movable inactive anchor.


Assuntos
Caenorhabditis elegans , Regulação da Expressão Gênica , Animais , Caenorhabditis elegans/genética , Mecanismo Genético de Compensação de Dose , Cromossomo X/genética
8.
Commun Biol ; 4(1): 1026, 2021 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-34471188

RESUMO

Autism arises in high and low-risk families. De novo mutation contributes to autism incidence in low-risk families as there is a higher incidence in the affected of the simplex families than in their unaffected siblings. But the extent of contribution in low-risk families cannot be determined solely from simplex families as they are a mixture of low and high-risk. The rate of de novo mutation in nearly pure populations of high-risk families, the multiplex families, has not previously been rigorously determined. Moreover, rates of de novo mutation have been underestimated from studies based on low resolution microarrays and whole exome sequencing. Here we report on findings from whole genome sequence (WGS) of both simplex families from the Simons Simplex Collection (SSC) and multiplex families from the Autism Genetic Resource Exchange (AGRE). After removing the multiplex samples with excessive cell-line genetic drift, we find that the contribution of de novo mutation in multiplex is significantly smaller than the contribution in simplex. We use WGS to provide high resolution CNV profiles and to analyze more than coding regions, and revise upward the rate in simplex autism due to an excess of de novo events targeting introns. Based on this study, we now estimate that de novo events contribute to 52-67% of cases of autism arising from low risk families, and 30-39% of cases of all autism.


Assuntos
Transtorno Autístico/epidemiologia , Predisposição Genética para Doença/genética , Mutação , Adulto , Transtorno do Espectro Autista , Transtorno Autístico/genética , Feminino , Humanos , Incidência , Masculino , Pessoa de Meia-Idade , New York/epidemiologia , Fatores de Risco , Adulto Jovem
9.
Nat Genet ; 53(8): 1125-1134, 2021 08.
Artigo em Inglês | MEDLINE | ID: mdl-34312540

RESUMO

Autism is a highly heritable complex disorder in which de novo mutation (DNM) variation contributes significantly to risk. Using whole-genome sequencing data from 3,474 families, we investigate another source of large-effect risk variation, ultra-rare variants. We report and replicate a transmission disequilibrium of private, likely gene-disruptive (LGD) variants in probands but find that 95% of this burden resides outside of known DNM-enriched genes. This variant class more strongly affects multiplex family probands and supports a multi-hit model for autism. Candidate genes with private LGD variants preferentially transmitted to probands converge on the E3 ubiquitin-protein ligase complex, intracellular transport and Erb signaling protein networks. We estimate that these variants are approximately 2.5 generations old and significantly younger than other variants of similar type and frequency in siblings. Overall, private LGD variants are under strong purifying selection and appear to act on a distinct set of genes not yet associated with autism.


Assuntos
Transtorno do Espectro Autista/genética , Predisposição Genética para Doença , Proteínas/genética , Transtorno Autístico/genética , Evolução Molecular , Dosagem de Genes , Haplótipos , Humanos , Desequilíbrio de Ligação , Modelos Genéticos , Mutação , Linhagem , Polimorfismo de Nucleotídeo Único , Mapas de Interação de Proteínas/genética , Irmãos , Sequenciamento Completo do Genoma
10.
Genetics ; 215(3): 869-886, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32327564

RESUMO

Baseline lung function, quantified as forced expiratory volume in the first second of exhalation (FEV1), is a standard diagnostic criterion used by clinicians to identify and classify lung diseases. Using whole-genome sequencing data from the National Heart, Lung, and Blood Institute Trans-Omics for Precision Medicine project, we identified a novel genetic association with FEV1 on chromosome 12 in 867 African American children with asthma (P = 1.26 × 10-8, ß = 0.302). Conditional analysis within 1 Mb of the tag signal (rs73429450) yielded one major and two other weaker independent signals within this peak. We explored statistical and functional evidence for all variants in linkage disequilibrium with the three independent signals and yielded nine variants as the most likely candidates responsible for the association with FEV1 Hi-C data and expression QTL analysis demonstrated that these variants physically interacted with KITLG (KIT ligand, also known as SCF), and their minor alleles were associated with increased expression of the KITLG gene in nasal epithelial cells. Gene-by-air-pollution interaction analysis found that the candidate variant rs58475486 interacted with past-year ambient sulfur dioxide exposure (P = 0.003, ß = 0.32). This study identified a novel protective genetic association with FEV1, possibly mediated through KITLG, in African American children with asthma. This is the first study that has identified a genetic association between lung function and KITLG, which has established a role in orchestrating allergic inflammation in asthma.


Assuntos
Poluição do Ar , Asma/genética , Volume Expiratório Forçado , Interação Gene-Ambiente , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Fator de Células-Tronco/genética , Adolescente , Negro ou Afro-Americano/genética , Asma/epidemiologia , Asma/fisiopatologia , Criança , Cromossomos Humanos Par 12/genética , Feminino , Humanos , Desequilíbrio de Ligação , Masculino , Mucosa Nasal/metabolismo , Fator de Células-Tronco/metabolismo , Adulto Jovem
11.
Genetics ; 212(3): 729-742, 2019 07.
Artigo em Inglês | MEDLINE | ID: mdl-31123040

RESUMO

Condensins are evolutionarily conserved protein complexes that are required for chromosome segregation during cell division and genome organization during interphase. In Caenorhabditis elegans, a specialized condensin, which forms the core of the dosage compensation complex (DCC), binds to and represses X chromosome transcription. Here, we analyzed DCC localization and the effect of DCC depletion on histone modifications, transcription factor binding, and gene expression using chromatin immunoprecipitation sequencing and mRNA sequencing. Across the X, the DCC accumulates at accessible gene regulatory sites in active chromatin and not heterochromatin. The DCC is required for reducing the levels of activating histone modifications, including H3K4me3 and H3K27ac, but not repressive modification H3K9me3. In X-to-autosome fusion chromosomes, DCC spreading into the autosomal sequences locally reduces gene expression, thus establishing a direct link between DCC binding and repression. Together, our results indicate that DCC-mediated transcription repression is associated with a reduction in the activity of X chromosomal gene regulatory elements.


Assuntos
Adenosina Trifosfatases/metabolismo , Proteínas de Caenorhabditis elegans/metabolismo , Proteínas de Ligação a DNA/metabolismo , Mecanismo Genético de Compensação de Dose , Código das Histonas , Complexos Multiproteicos/metabolismo , Sequências Reguladoras de Ácido Nucleico , Cromossomo X/genética , Adenosina Trifosfatases/genética , Animais , Caenorhabditis elegans , Proteínas de Caenorhabditis elegans/genética , Cromatina/metabolismo , Proteínas de Ligação a DNA/genética , Histonas/genética , Histonas/metabolismo , Complexos Multiproteicos/genética , Fatores de Transcrição/metabolismo , Cromossomo X/metabolismo
13.
Elife ; 62017 05 30.
Artigo em Inglês | MEDLINE | ID: mdl-28562241

RESUMO

In many organisms, it remains unclear how X chromosomes are specified for dosage compensation, since DNA sequence motifs shown to be important for dosage compensation complex (DCC) recruitment are themselves not X-specific. Here, we addressed this problem in C. elegans. We found that the DCC recruiter, SDC-2, is required to maintain open chromatin at a small number of primary DCC recruitment sites, whose sequence and genomic context are X-specific. Along the X, primary recruitment sites are interspersed with secondary sites, whose function is X-dependent. A secondary site can ectopically recruit the DCC when additional recruitment sites are inserted either in tandem or at a distance (>30 kb). Deletion of a recruitment site on the X results in reduced DCC binding across several megabases surrounded by topologically associating domain (TAD) boundaries. Our work elucidates that hierarchy and long-distance cooperativity between gene-regulatory elements target a single chromosome for regulation.


Assuntos
Caenorhabditis elegans/genética , Mecanismo Genético de Compensação de Dose , Cromossomo X/metabolismo , Animais , Cromatina/metabolismo , Sindecana-2/metabolismo
14.
Genome Biol ; 14(10): R112, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-24125077

RESUMO

BACKGROUND: Condensins are multi-subunit protein complexes that are essential for chromosome condensation during mitosis and meiosis, and play key roles in transcription regulation during interphase. Metazoans contain two condensins, I and II, which perform different functions and localize to different chromosomal regions. Caenorhabditis elegans contains a third condensin, I(DC), that is targeted to and represses transcription of the X chromosome for dosage compensation. RESULTS: To understand condensin binding and function, we performed ChIP-seq analysis of C. elegans condensins in mixed developmental stage embryos, which contain predominantly interphase nuclei. Condensins bind to a subset of active promoters, tRNA genes and putative enhancers. Expression analysis in kle-2-mutant larvae suggests that the primary effect of condensin II on transcription is repression. A DNA sequence motif, GCGC, is enriched at condensin II binding sites. A sequence extension of this core motif, AGGG, creates the condensin IDC motif. In addition to differences in recruitment that result in X-enrichment of condensin I(DC) and condensin II binding to all chromosomes, we provide evidence for a shared recruitment mechanism, as condensin I(DC) recruiter SDC-2 also recruits condensin II to the condensin I(DC) recruitment sites on the X. In addition, we found that condensin sites overlap extensively with the cohesin loader SCC-2, and that SDC-2 also recruits SCC-2 to the condensin I(DC) recruitment sites. CONCLUSIONS: Our results provide the first genome-wide view of metazoan condensin II binding in interphase, define putative recruitment motifs, and illustrate shared loading mechanisms for condensin I(DC) and condensin II.


Assuntos
Adenosina Trifosfatases/genética , Adenosina Trifosfatases/metabolismo , Caenorhabditis elegans/genética , Caenorhabditis elegans/metabolismo , Proteínas de Ligação a DNA/genética , Proteínas de Ligação a DNA/metabolismo , Estudo de Associação Genômica Ampla , Complexos Multiproteicos/genética , Complexos Multiproteicos/metabolismo , Animais , Sequência de Bases , Sítios de Ligação , Imunoprecipitação da Cromatina , Cromossomos/genética , Cromossomos/metabolismo , Sequenciamento de Nucleotídeos em Larga Escala , Masculino , Mutação , Motivos de Nucleotídeos , Matrizes de Pontuação de Posição Específica , Regiões Promotoras Genéticas , Ligação Proteica , Reprodutibilidade dos Testes , Fatores de Transcrição/metabolismo , Transcrição Gênica
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA