Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 12 de 12
Filter
Add more filters










Publication year range
1.
Nat Genet ; 53(3): 313-321, 2021 03.
Article in English | MEDLINE | ID: mdl-33664507

ABSTRACT

Induced pluripotent stem cells (iPSCs) are an established cellular system to study the impact of genetic variants in derived cell types and developmental contexts. However, in their pluripotent state, the disease impact of genetic variants is less well known. Here, we integrate data from 1,367 human iPSC lines to comprehensively map common and rare regulatory variants in human pluripotent cells. Using this population-scale resource, we report hundreds of new colocalization events for human traits specific to iPSCs, and find increased power to identify rare regulatory variants compared with somatic tissues. Finally, we demonstrate how iPSCs enable the identification of causal genes for rare diseases.


Subject(s)
Genetic Variation , Induced Pluripotent Stem Cells/physiology , Quantitative Trait Loci , Bardet-Biedl Syndrome/genetics , Calcium Channels/genetics , Cell Line , Cerebellar Ataxia/genetics , DNA Methylation , Gene Expression , Humans , Induced Pluripotent Stem Cells/cytology , Polymorphism, Single Nucleotide , Proteins/genetics , Rare Diseases/genetics , Regulatory Sequences, Nucleic Acid , Sequence Analysis, RNA , Whole Genome Sequencing
2.
Nat Commun ; 11(1): 2927, 2020 06 10.
Article in English | MEDLINE | ID: mdl-32522982

ABSTRACT

Structural variants (SVs) and short tandem repeats (STRs) comprise a broad group of diverse DNA variants which vastly differ in their sizes and distributions across the genome. Here, we identify genomic features of SV classes and STRs that are associated with gene expression and complex traits, including their locations relative to eGenes, likelihood of being associated with multiple eGenes, associated eGene types (e.g., coding, noncoding, level of evolutionary constraint), effect sizes, linkage disequilibrium with tagging single nucleotide variants used in GWAS, and likelihood of being associated with GWAS traits. We identify a set of high-impact SVs/STRs associated with the expression of three or more eGenes via chromatin loops and show that they are highly enriched for being associated with GWAS traits. Our study provides insights into the genomic properties of structural variant classes and short tandem repeats that are associated with gene expression and human traits.


Subject(s)
Microsatellite Repeats/genetics , Cell Line , Genetic Variation/genetics , Genome-Wide Association Study , Humans , Linkage Disequilibrium/genetics , Multifactorial Inheritance , Polymorphism, Single Nucleotide/genetics , Quantitative Trait Loci/genetics
3.
Nat Commun ; 11(1): 2928, 2020 06 10.
Article in English | MEDLINE | ID: mdl-32522985

ABSTRACT

Structural variants (SVs) and short tandem repeats (STRs) are important sources of genetic diversity but are not routinely analyzed in genetic studies because they are difficult to accurately identify and genotype. Because SVs and STRs range in size and type, it is necessary to apply multiple algorithms that incorporate different types of evidence from sequencing data and employ complex filtering strategies to discover a comprehensive set of high-quality and reproducible variants. Here we assemble a set of 719 deep whole genome sequencing (WGS) samples (mean 42×) from 477 distinct individuals which we use to discover and genotype a wide spectrum of SV and STR variants using five algorithms. We use 177 unique pairs of genetic replicates to identify factors that affect variant call reproducibility and develop a systematic filtering strategy to create of one of the most complete and well characterized maps of SVs and STRs to date.


Subject(s)
Microsatellite Repeats/genetics , Whole Genome Sequencing/methods , Algorithms , Computational Biology , Genotype , Haplotypes/genetics , High-Throughput Nucleotide Sequencing , Humans
4.
Elife ; 82019 11 20.
Article in English | MEDLINE | ID: mdl-31746734

ABSTRACT

The MHC region is highly associated with autoimmune and infectious diseases. Here we conduct an in-depth interrogation of associations between genetic variation, gene expression and disease. We create a comprehensive map of regulatory variation in the MHC region using WGS from 419 individuals to call eight-digit HLA types and RNA-seq data from matched iPSCs. Building on this regulatory map, we explored GWAS signals for 4083 traits, detecting colocalization for 180 disease loci with eQTLs. We show that eQTL analyses taking HLA type haplotypes into account have substantially greater power compared with only using single variants. We examined the association between the 8.1 ancestral haplotype and delayed colonization in Cystic Fibrosis, postulating that downregulation of RNF5 expression is the likely causal mechanism. Our study provides insights into the genetic architecture of the MHC region and pinpoints disease associations that are due to differential expression of HLA genes and non-HLA genes.


Subject(s)
Cystic Fibrosis/genetics , Genetic Predisposition to Disease , Major Histocompatibility Complex/genetics , Quantitative Trait Loci/genetics , Adolescent , Adult , Aged , Aged, 80 and over , Alleles , Chromosome Mapping , Cystic Fibrosis/pathology , Female , Genome-Wide Association Study , HLA Antigens/genetics , Haplotypes , Humans , Male , Middle Aged , Polymorphism, Single Nucleotide/genetics , RNA-Seq , Young Adult
5.
Nat Commun ; 10(1): 1054, 2019 03 05.
Article in English | MEDLINE | ID: mdl-30837461

ABSTRACT

While genetic variation at chromatin loops is relevant for human disease, the relationships between contact propensity (the probability that loci at loops physically interact), genetics, and gene regulation are unclear. We quantitatively interrogate these relationships by comparing Hi-C and molecular phenotype data across cell types and haplotypes. While chromatin loops consistently form across different cell types, they have subtle quantitative differences in contact frequency that are associated with larger changes in gene expression and H3K27ac. For the vast majority of loci with quantitative differences in contact frequency across haplotypes, the changes in magnitude are smaller than those across cell types; however, the proportional relationships between contact propensity, gene expression, and H3K27ac are consistent. These findings suggest that subtle changes in contact propensity have a biologically meaningful role in gene regulation and could be a mechanism by which regulatory genetic variants in loop anchors mediate effects on expression.


Subject(s)
Chromatin/genetics , DNA/genetics , Gene Expression Regulation , Histones/genetics , Quantitative Trait Loci/genetics , Adolescent , Adult , Aged , Cell Line , Chromatin/metabolism , DNA/metabolism , Female , Histones/metabolism , Humans , Induced Pluripotent Stem Cells , Male , Middle Aged , Myocytes, Cardiac , Nucleic Acid Conformation , Polymorphism, Single Nucleotide , Whole Genome Sequencing , Young Adult
6.
Cell Rep ; 24(4): 883-894, 2018 07 24.
Article in English | MEDLINE | ID: mdl-30044985

ABSTRACT

To understand the mutational burden of human induced pluripotent stem cells (iPSCs), we sequenced genomes of 18 fibroblast-derived iPSC lines and identified different classes of somatic mutations based on structure, origin, and frequency. Copy-number alterations affected 295 kb in each sample and strongly impacted gene expression. UV-damage mutations were present in ∼45% of the iPSCs and accounted for most of the observed heterogeneity in mutation rates across lines. Subclonal mutations (not present in all iPSCs within a line) composed 10% of point mutations and, compared with clonal variants, showed an enrichment in active promoters and increased association with altered gene expression. Our study shows that, by combining WGS, transcriptome, and epigenome data, we can understand the mutational burden of each iPSC line on an individual basis and suggests that this information could be used to prioritize iPSC lines for models of specific human diseases and/or transplantation therapy.


Subject(s)
Induced Pluripotent Stem Cells/cytology , Induced Pluripotent Stem Cells/physiology , Cell Differentiation/physiology , Cells, Cultured , Cellular Reprogramming/genetics , Humans , Mutation , Mutation Rate
8.
Genes (Basel) ; 8(9)2017 08 24.
Article in English | MEDLINE | ID: mdl-28837078

ABSTRACT

Retinitis pigmentosa (RP) causes progressive photoreceptor loss resulting from mutations in over 80 genes. This study identified the genetic cause of RP in three members of a non-consanguineous pedigree. Detailed ophthalmic evaluation was performed in the three affected family members. Whole exome sequencing (WES) and whole genome sequencing (WGS) were performed in the three affected and the two unaffected family members and variants were filtered to detect rare, potentially deleterious variants segregating with disease. WES and WGS did not identify potentially pathogenic variants shared by all three affected members. However, WES identified a previously reported homozygous nonsense mutation in KIZ (c.226C>T, p.Arg76*) in two affected sisters, but not in their affected second cousin. WGS revealed a novel 1.135 kb homozygous deletion in a retina transcript of C21orf2 and a novel 30.651 kb heterozygous deletion in CACNA2D4 in the affected second cousin. The sisters with the KIZ mutation carried no copies of the C21orf2 or CACNA2D4 deletions, while the second cousin with the C21orf2 and CACNA2D4 deletions carried no copies of the KIZ mutation. This study identified two independent, homozygous mutations in genes previously reported in autosomal recessive RP in a non-consanguineous family, and demonstrated the value of WGS when WES fails to identify likely disease-causing mutations.

9.
Cell Stem Cell ; 20(4): 533-546.e7, 2017 04 06.
Article in English | MEDLINE | ID: mdl-28388430

ABSTRACT

In this study, we used whole-genome sequencing and gene expression profiling of 215 human induced pluripotent stem cell (iPSC) lines from different donors to identify genetic variants associated with RNA expression for 5,746 genes. We were able to predict causal variants for these expression quantitative trait loci (eQTLs) that disrupt transcription factor binding and validated a subset of them experimentally. We also identified copy-number variant (CNV) eQTLs, including some that appear to affect gene expression by altering the copy number of intergenic regulatory regions. In addition, we were able to identify effects on gene expression of rare genic CNVs and regulatory single-nucleotide variants and found that reactivation of gene expression on the X chromosome depends on gene chromosomal position. Our work highlights the value of iPSCs for genetic association analyses and provides a unique resource for investigating the genetic regulation of gene expression in pluripotent cells.


Subject(s)
Gene Expression Profiling/methods , Gene Expression Regulation , Genetic Variation , Induced Pluripotent Stem Cells/metabolism , Binding Sites/genetics , Cellular Reprogramming/genetics , Chromosomes, Human, X/genetics , DNA Copy Number Variations/genetics , Genetic Heterogeneity , Humans , Molecular Sequence Annotation , Quantitative Trait Loci/genetics , Regulatory Sequences, Nucleic Acid/genetics , Transcription Factors/metabolism
10.
Stem Cell Reports ; 8(4): 1086-1100, 2017 04 11.
Article in English | MEDLINE | ID: mdl-28410642

ABSTRACT

Large-scale collections of induced pluripotent stem cells (iPSCs) could serve as powerful model systems for examining how genetic variation affects biology and disease. Here we describe the iPSCORE resource: a collection of systematically derived and characterized iPSC lines from 222 ethnically diverse individuals that allows for both familial and association-based genetic studies. iPSCORE lines are pluripotent with high genomic integrity (no or low numbers of somatic copy-number variants) as determined using high-throughput RNA-sequencing and genotyping arrays, respectively. Using iPSCs from a family of individuals, we show that iPSC-derived cardiomyocytes demonstrate gene expression patterns that cluster by genetic background, and can be used to examine variants associated with physiological and disease phenotypes. The iPSCORE collection contains representative individuals for risk and non-risk alleles for 95% of SNPs associated with human phenotypes through genome-wide association studies. Our study demonstrates the utility of iPSCORE for examining how genetic variants influence molecular and physiological traits in iPSCs and derived cell lines.


Subject(s)
Arrhythmias, Cardiac/genetics , Databases, Factual , Genetic Association Studies , Genetic Variation , Induced Pluripotent Stem Cells/metabolism , Myocytes, Cardiac/metabolism , Arrhythmias, Cardiac/ethnology , Arrhythmias, Cardiac/metabolism , Arrhythmias, Cardiac/physiopathology , Cell Differentiation , Cell Line , Cellular Reprogramming/genetics , Genotype , High-Throughput Nucleotide Sequencing , Humans , Induced Pluripotent Stem Cells/cytology , Multigene Family , Myocytes, Cardiac/cytology , Oligonucleotide Array Sequence Analysis , Phenotype , Polymorphism, Single Nucleotide , Racial Groups
11.
Physiol Genomics ; 48(12): 922-927, 2016 12 01.
Article in English | MEDLINE | ID: mdl-27764769

ABSTRACT

While more than 250 genes are known to cause inherited retinal degenerations (IRD), nearly 40-50% of families have the genetic basis for their disease unknown. In this study we sought to identify the underlying cause of IRD in a family by whole genome sequence (WGS) analysis. Clinical characterization including standard ophthalmic examination, fundus photography, visual field testing, electroretinography, and review of medical and family history was performed. WGS was performed on affected and unaffected family members using Illumina HiSeq X10. Sequence reads were aligned to hg19 using BWA-MEM and variant calling was performed with Genome Analysis Toolkit. The called variants were annotated with SnpEff v4.11, PolyPhen v2.2.2, and CADD v1.3. Copy number variations were called using Genome STRiP (svtoolkit 2.00.1611) and SpeedSeq software. Variants were filtered to detect rare potentially deleterious variants segregating with disease. Candidate variants were validated by dideoxy sequencing. Clinical evaluation revealed typical adolescent-onset recessive retinitis pigmentosa (arRP) in affected members. WGS identified about 4 million variants in each individual. Two rare and potentially deleterious compound heterozygous variants p.Arg281Cys and p.Arg487* were identified in the gene ATP/GTP binding protein like 5 (AGBL5) as likely causal variants. No additional variants in IRD genes that segregated with disease were identified. Mutation analysis confirmed the segregation of these variants with the IRD in the pedigree. Homology models indicated destabilization of AGBL5 due to the p.Arg281Cys change. Our findings establish the involvement of mutations in AGBL5 in RP and validate the WGS variant filtering pipeline we designed.


Subject(s)
Carboxypeptidases/genetics , Retinitis Pigmentosa/genetics , Adolescent , DNA Mutational Analysis , Electroretinography/methods , Female , Genetic Association Studies/methods , Humans , Male , Mutation/genetics , Pedigree , Retinal Degeneration/genetics , Whole Genome Sequencing/methods , Young Adult
12.
Nat Med ; 18(10): 1503-10, 2012 Oct.
Article in English | MEDLINE | ID: mdl-22983396

ABSTRACT

The discovery of potent inhibitors of the BRAF proto-oncogene has revolutionized therapy for melanoma harboring mutations in BRAF, yet NRAS-mutant melanoma remains without an effective therapy. Because direct pharmacological inhibition of the RAS proto-oncogene has thus far been unsuccessful, we explored systems biology approaches to identify synergistic drug combination(s) that can mimic RAS inhibition. Here, leveraging an inducible mouse model of NRAS-mutant melanoma, we show that pharmacological inhibition of mitogen-activated protein kinase kinase (MEK) activates apoptosis but not cell-cycle arrest, which is in contrast to complete genetic neuroblastoma RAS homolog (NRAS) extinction, which triggers both of these effects. Network modeling pinpointed cyclin-dependent kinase 4 (CDK4) as a key driver of this differential phenotype. Accordingly, combined pharmacological inhibition of MEK and CDK4 in vivo led to substantial synergy in therapeutic efficacy. We suggest a gradient model of oncogenic NRAS signaling in which the output is gated, resulting in the decoupling of discrete downstream biological phenotypes as a result of incomplete inhibition. Such a gated signaling model offers a new framework to identify nonobvious coextinction target(s) for combined pharmacological inhibition in NRAS-mutant melanomas.


Subject(s)
Apoptosis/drug effects , Cyclin-Dependent Kinase 4/antagonists & inhibitors , Melanoma/metabolism , Mitogen-Activated Protein Kinase Kinases/antagonists & inhibitors , Proto-Oncogene Proteins p21(ras)/antagonists & inhibitors , Proto-Oncogene Proteins p21(ras)/metabolism , Animals , Benzimidazoles/pharmacology , Cell Cycle Checkpoints/drug effects , Cell Line, Tumor , Cell Proliferation/drug effects , Cell Survival , Cyclin-Dependent Kinase 4/metabolism , Doxycycline/pharmacology , Female , Genes, ras , Humans , Melanoma/drug therapy , Melanoma/genetics , Melanoma/pathology , Mice , Mice, Knockout , Mice, Nude , Middle Aged , Mitogen-Activated Protein Kinase Kinases/metabolism , Proto-Oncogene Mas , Proto-Oncogene Proteins p21(ras)/genetics , Signal Transduction/drug effects
SELECTION OF CITATIONS
SEARCH DETAIL
...