Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 24
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Cell ; 176(3): 663-675.e19, 2019 01 24.
Artículo en Inglés | MEDLINE | ID: mdl-30661756

RESUMEN

In order to provide a comprehensive resource for human structural variants (SVs), we generated long-read sequence data and analyzed SVs for fifteen human genomes. We sequence resolved 99,604 insertions, deletions, and inversions including 2,238 (1.6 Mbp) that are shared among all discovery genomes with an additional 13,053 (6.9 Mbp) present in the majority, indicating minor alleles or errors in the reference. Genotyping in 440 additional genomes confirms the most common SVs in unique euchromatin are now sequence resolved. We report a ninefold SV bias toward the last 5 Mbp of human chromosomes with nearly 55% of all VNTRs (variable number of tandem repeats) mapping to this portion of the genome. We identify SVs affecting coding and noncoding regulatory loci improving annotation and interpretation of functional variation. These data provide the framework to construct a canonical human reference and a resource for developing advanced representations capable of capturing allelic diversity.


Asunto(s)
Frecuencia de los Genes/genética , Genoma Humano/genética , Variación Estructural del Genoma/genética , Alelos , Eucromatina/genética , Genómica/métodos , Humanos , Repeticiones de Minisatélite/genética , Análisis de Secuencia de ADN/métodos
2.
Cell ; 172(5): 897-909.e21, 2018 02 22.
Artículo en Inglés | MEDLINE | ID: mdl-29474918

RESUMEN

X-linked Dystonia-Parkinsonism (XDP) is a Mendelian neurodegenerative disease that is endemic to the Philippines and is associated with a founder haplotype. We integrated multiple genome and transcriptome assembly technologies to narrow the causal mutation to the TAF1 locus, which included a SINE-VNTR-Alu (SVA) retrotransposition into intron 32 of the gene. Transcriptome analyses identified decreased expression of the canonical cTAF1 transcript among XDP probands, and de novo assembly across multiple pluripotent stem-cell-derived neuronal lineages discovered aberrant TAF1 transcription that involved alternative splicing and intron retention (IR) in proximity to the SVA that was anti-correlated with overall TAF1 expression. CRISPR/Cas9 excision of the SVA rescued this XDP-specific transcriptional signature and normalized TAF1 expression in probands. These data suggest an SVA-mediated aberrant transcriptional mechanism associated with XDP and may provide a roadmap for layered technologies and integrated assembly-based analyses for other unsolved Mendelian disorders.


Asunto(s)
Trastornos Distónicos/genética , Enfermedades Genéticas Ligadas al Cromosoma X/genética , Genoma Humano , Transcriptoma/genética , Empalme Alternativo/genética , Elementos Alu/genética , Secuencia de Bases , Sistemas CRISPR-Cas/genética , Estudios de Cohortes , Familia , Femenino , Sitios Genéticos , Haplotipos/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Histona Acetiltransferasas/genética , Histona Acetiltransferasas/metabolismo , Humanos , Células Madre Pluripotentes Inducidas/metabolismo , Intrones/genética , Masculino , Repeticiones de Minisatélite/genética , Modelos Genéticos , Degeneración Nerviosa/genética , Degeneración Nerviosa/patología , Células-Madre Neurales/metabolismo , Neuronas/metabolismo , ARN Mensajero/genética , ARN Mensajero/metabolismo , Elementos de Nucleótido Esparcido Corto , Factores Asociados con la Proteína de Unión a TATA/genética , Factores Asociados con la Proteína de Unión a TATA/metabolismo , Factor de Transcripción TFIID/genética , Factor de Transcripción TFIID/metabolismo
3.
PLoS Genet ; 15(3): e1008075, 2019 03.
Artículo en Inglés | MEDLINE | ID: mdl-30917130

RESUMEN

Human chromosome 15q25 is involved in several disease-associated structural rearrangements, including microdeletions and chromosomal markers with inverted duplications. Using comparative fluorescence in situ hybridization, strand-sequencing, single-molecule, real-time sequencing and Bionano optical mapping analyses, we investigated the organization of the 15q25 region in human and nonhuman primates. We found that two independent inversions occurred in this region after the fission event that gave rise to phylogenetic chromosomes XIV and XV in humans and great apes. One of these inversions is still polymorphic in the human population today and may confer differential susceptibility to 15q25 microdeletions and inverted duplications. The inversion breakpoints map within segmental duplications containing core duplicons of the GOLGA gene family and correspond to the site of an ancestral centromere, which became inactivated about 25 million years ago. The inactivation of this centromere likely released segmental duplications from recombination repression typical of centromeric regions. We hypothesize that this increased the frequency of ectopic recombination creating a hotspot of hominid inversions where dispersed GOLGA core elements now predispose this region to recurrent genomic rearrangements associated with disease.


Asunto(s)
Inversión Cromosómica , Cromosomas Humanos Par 15/genética , Duplicaciones Segmentarias en el Genoma , Animales , Autoantígenos/genética , Inestabilidad Cromosómica , Evolución Molecular , Dosificación de Gen , Reordenamiento Génico , Variación Genética , Proteínas de la Matriz de Golgi/genética , Hominidae/genética , Humanos , Familia de Multigenes , Filogenia , Primates/genética , Recombinación Genética , Especificidad de la Especie
4.
Proc Natl Acad Sci U S A ; 115(19): E4433-E4442, 2018 05 08.
Artículo en Inglés | MEDLINE | ID: mdl-29686068

RESUMEN

Structural variation and single-nucleotide variation of the complement factor H (CFH) gene family underlie several complex genetic diseases, including age-related macular degeneration (AMD) and atypical hemolytic uremic syndrome (AHUS). To understand its diversity and evolution, we performed high-quality sequencing of this ∼360-kbp locus in six primate lineages, including multiple human haplotypes. Comparative sequence analyses reveal two distinct periods of gene duplication leading to the emergence of four CFH-related (CFHR) gene paralogs (CFHR2 and CFHR4 ∼25-35 Mya and CFHR1 and CFHR3 ∼7-13 Mya). Remarkably, all evolutionary breakpoints share a common ∼4.8-kbp segment corresponding to an ancestral CFHR gene promoter that has expanded independently throughout primate evolution. This segment is recurrently reused and juxtaposed with a donor duplication containing exons 8 and 9 from ancestral CFH, creating four CFHR fusion genes that include lineage-specific members of the gene family. Combined analysis of >5,000 AMD cases and controls identifies a significant burden of a rare missense mutation that clusters at the N terminus of CFH [P = 5.81 × 10-8, odds ratio (OR) = 9.8 (3.67-Infinity)]. A bipolar clustering pattern of rare nonsynonymous mutations in patients with AMD (P < 10-3) and AHUS (P = 0.0079) maps to functional domains that show evidence of positive selection during primate evolution. Our structural variation analysis in >2,400 individuals reveals five recurrent rearrangement breakpoints that show variable frequency among AMD cases and controls. These data suggest a dynamic and recurrent pattern of mutation critical to the emergence of new CFHR genes but also in the predisposition to complex human genetic disease phenotypes.


Asunto(s)
Evolución Molecular , Degeneración Macular/genética , Degeneración Macular/patología , Mutación , Polimorfismo de Nucleótido Simple , Selección Genética , Animales , Factor H de Complemento/genética , Exones , Predisposición Genética a la Enfermedad , Genotipo , Haplotipos , Humanos , Familia de Multigenes , Fenotipo , Primates , Factores de Riesgo
5.
Genome Res ; 26(11): 1453-1467, 2016 11.
Artículo en Inglés | MEDLINE | ID: mdl-27803192

RESUMEN

Recurrent rearrangements of Chromosome 8p23.1 are associated with congenital heart defects and developmental delay. The complexity of this region has led to inconsistencies in the current reference assembly, confounding studies of genetic variation. Using comparative sequence-based approaches, we generated a high-quality 6.3-Mbp alternate reference assembly of an inverted Chromosome 8p23.1 haplotype. Comparison with nonhuman primates reveals a 746-kbp duplicative transposition and two separate inversion events that arose in the last million years of human evolution. The breakpoints associated with these rearrangements map to an ape-specific interchromosomal core duplicon that clusters at sites of evolutionary inversion (P = 7.8 × 10-5). Refinement of microdeletion breakpoints identifies a subgroup of patients that map to the same interchromosomal core involved in the evolutionary formation of the duplication blocks. Our results define a higher-order genomic instability element that has shaped the structure of specific chromosomes during primate evolution contributing to rearrangements associated with inversion and disease.


Asunto(s)
Evolución Molecular , Predisposición Genética a la Enfermedad , Inestabilidad Genómica , Duplicaciones Segmentarias en el Genoma , Animales , Puntos de Rotura del Cromosoma , Deleción Cromosómica , Cromosomas Humanos Par 8/genética , Humanos , Primates/genética
6.
J Neuroinflammation ; 13(1): 81, 2016 Apr 18.
Artículo en Inglés | MEDLINE | ID: mdl-27090374

RESUMEN

BACKGROUND: Age-related macular degeneration (AMD) is the leading cause of vision loss in Western societies with a strong genetic component. Candidate gene studies as well as genome-wide association studies strongly implicated genetic variations in complement genes to be involved in disease risk. So far, no association of AMD with complement component 4 (C4) was reported probably due to the complex nature of the C4 locus on chromosome 6. METHODS: We used multiplex ligation-dependent probe amplification (MLPA) to determine the copy number of the C4 gene as well as of both relevant isoforms, C4A and C4B, and assessed their association with AMD using logistic regression models. RESULTS: Here, we report on the analysis of 2645 individuals (1536 probands and 1109 unaffected controls), across three different centers, for multiallelic copy number variation (CNV) at the C4 locus. We find strong statistical significance for association of increased copy number of C4A (OR 0.81 (0.73; 0.89);P = 4.4 × 10(-5)), with the effect most pronounced in individuals over 78 years (OR 0.67 (0.55; 0.81)) and females (OR 0.77 (0.68; 0.87)). Furthermore, this association is independent of known AMD-associated risk variants in the nearby CFB/C2 locus, particularly in females and in individuals over 78 years. CONCLUSIONS: Our data strengthen the notion that complement dysregulation plays a crucial role in AMD etiology, an important finding for early intervention strategies and future therapeutics. In addition, for the first time, we provide evidence that multiallelic CNVs are associated with AMD pathology.


Asunto(s)
Complemento C4a/genética , Dosificación de Gen , Predisposición Genética a la Enfermedad/genética , Degeneración Macular/genética , Alelos , Estudios de Cohortes , Femenino , Estudio de Asociación del Genoma Completo , Genotipo , Humanos , Masculino , Reacción en Cadena de la Polimerasa Multiplex
7.
BMC Genomics ; 15: 329, 2014 May 01.
Artículo en Inglés | MEDLINE | ID: mdl-24885186

RESUMEN

BACKGROUND: Intrachromosomal segmental duplications provide the substrate for non-allelic homologous recombination, facilitating extensive copy number variation in the human genome. Many multi-copy gene families are embedded within genomic regions with high levels of sequence identity (>95%) and therefore pose considerable analytical challenges. In some cases, the complexity involved in analyzing such regions is largely underestimated. Rapid, cost effective analysis of multi-copy gene regions have typically implemented quantitative approaches, however quantitative data are not an absolute means of certainty. Therefore any technique prone to degrees of measurement error can produce ambiguous results that may lead to spurious associations with complex disease. RESULTS: In this study we have focused on testing the accuracy and reproducibility of quantitative analysis techniques. With reference to the C-C Chemokine Ligand-3-like-1 (CCL3L1) gene, we performed analysis using real-time Quantitative PCR (QPCR), Multiplex Ligation-dependent Probe Amplification (MLPA) and Paralogue Ratio Test (PRT). After controlling for potential outside variables on assay performance, including DNA concentration, quality, preparation and storage conditions, we find that real-time QPCR produces data that does not cluster tightly around copy number integer values, with variation substantially greater than that of the MLPA or PRT systems. We find that the method of rounding real-time QPCR measurements can potentially lead to mis-scoring of copy number genotypes and suggest caution should be exercised in interpreting QPCR data. CONCLUSIONS: We conclude that real-time QPCR is inherently prone to measurement error, even under conditions that would seem favorable for association studies. Our results indicate that potential variability in the physicochemical properties of the DNA samples cannot solely explain the poor performance exhibited by the real-time QPCR systems. We recommend that more robust approaches such as PRT or MLPA should be used to genotype multi-allelic copy number variation in disease association studies and suggest several approaches which can be implemented to ensure the quality of the copy number typing using quantitative methods.


Asunto(s)
Alelos , Variaciones en el Número de Copia de ADN , Genotipo , Duplicaciones Segmentarias en el Genoma , Secuencia de Bases , Cartilla de ADN , Reacción en Cadena en Tiempo Real de la Polimerasa
8.
Hum Mol Genet ; 21(23): 5229-36, 2012 Dec 01.
Artículo en Inglés | MEDLINE | ID: mdl-22936692

RESUMEN

Genetic variation in the gene encoding complement factor H (CFH) on chromosome 1q31 has repeatedly been associated with an increased risk of age-related macular degeneration (AMD); however, previous studies have had inadequate numbers of participants across a sufficiently wide age range to determine whether the association varies by age. We conducted a genetic case-control study using data from 2294 cases and 2294 controls selected from the Melbourne Collaborative Cohort Study, matched on age, sex and region of origin. Four consistently replicated CFH single-nucleotide polymorphisms (SNPs) were genotyped: rs1061170 (Y402H), rs2274700, rs393955 and rs800292; their relationship with AMD prevalence was determined across the age range 48-86. A difference in genotype frequencies was seen across age groups, where the low-risk homozygote prevalence rose with each increasing age group. Associations with early AMD were strongly modified by age for three of the four SNPs (interaction P-value: 0.01-0.00003). An inverse association between the high-risk homozygote for each SNP and early AMD was observed in the younger age groups [odds ratios (OR) range 0.37-0.48 for age <55], reversing to a positive association with increasing age (OR 1.87-2.8 for age >75). The direction of associations for this gene change was from inverse to risk with increasing age. These findings have important implications for predictive models for AMD and potentially other age-related diseases which extrapolate risks from older cohorts, as they assume homogeneity of association by age, which might not exist.


Asunto(s)
Factor H de Complemento/genética , Degeneración Macular/genética , Factores de Edad , Anciano , Anciano de 80 o más Años , Alelos , Estudios de Casos y Controles , Femenino , Frecuencia de los Genes , Genotipo , Humanos , Masculino , Persona de Mediana Edad , Oportunidad Relativa
9.
Genomics ; 101(2): 86-93, 2013 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-23123317

RESUMEN

Genome structural variation shows remarkable complexity with respect to copy number, sequence content and distribution. While the discovery of copy number polymorphisms (CNP) has increased exponentially in recent years, the transition from discovery to genotyping has proved challenging, particularly for CNPs embedded in complex regions of the genome. CNPs that are collectively common in the population and possess a dynamic range of copy numbers have proved the most difficult to genotype in association studies. This is in some part due to technical limitations of genotyping assays and the sequence properties of the genomic region being analyzed. Here we describe in detail the basis of a number of molecular techniques used to genotype complex CNPs, compare and contrast these approaches for determination of multi-allelic copy number, and discuss the potential application of these techniques in genetic studies.


Asunto(s)
Variaciones en el Número de Copia de ADN , Técnicas de Genotipaje , Southern Blotting , Hibridación Genómica Comparativa , Electroforesis en Gel de Campo Pulsado , Hibridación Fluorescente in Situ , Hibridación de Ácido Nucleico , Análisis de Secuencia por Matrices de Oligonucleótidos , Reacción en Cadena de la Polimerasa
10.
Hum Mutat ; 34(1): 1-13, 2013 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-22837109

RESUMEN

The human genome contains a significant amount of sequence variation, from single nucleotide polymorphisms to large stretches of DNA that may be present in a range of different copies between individuals. Several such regions are variable in >1% of the population (referred to as copy number polymorphisms or CNPs), and many studies have looked for associations between the copy number of genes within multiallelic CNPs and disease susceptibility. Associations have indeed been described for several genes, including the ß-defensins (DEFB4, DEFB103, DEFB104), chemokine ligand 3 like 1 (CCL3L1), Fc gamma receptor 3B (FCGR3B), and complement component C4 (C4). However, follow-up replication in independent cohorts has failed to reproduce a number of these associations. It is clear that replicated associations such as those between C4 and systemic lupus erythematosus, and ß-defensin and psoriasis, have used robust genotyping methodologies. Technical issues associated with genotyping sequences of high identity may therefore account for failure to replicate other associations. Here, we compare and contrast the most popular approaches that have been used to genotype CNPs, describe how they have been applied in different situations, and discuss potential reasons for the difficulty in reproducibly linking multiallelic CNPs to complex diseases.


Asunto(s)
Quimiocina CCL3/genética , Complemento C4/genética , Variaciones en el Número de Copia de ADN , Predisposición Genética a la Enfermedad/genética , Receptores de IgG/genética , beta-Defensinas/genética , Proteínas Ligadas a GPI/genética , Técnicas de Genotipaje/métodos , Humanos
11.
Mol Vis ; 19: 822-8, 2013.
Artículo en Inglés | MEDLINE | ID: mdl-23592919

RESUMEN

PURPOSE: The ARMS2/HTRA1 genes at the 10q26 locus have been associated with risk of age-related macular degeneration (AMD), with the most significantly associated variants being A69S (rs10490924), del443ins54 (EU427539) and rs11200638. We wished to explore the association of the del443ins54 in two ethnically different populations from India and Australia. METHODS: The del443ins54 was screened in a large cohort of ~1500 subjects from these two populations by a combination of PCR-based agarose gel electrophoresis and validated by resequencing. Statistical analysis comprised the calculations of allele, genotype and haplotype frequencies along with their p values and corresponding odds ratios (OR), and 95% confidence intervals (95% CI) and measures of linkage disequilibrium (LD). RESULTS: The del443ins54 was significantly associated with AMD in both the Indian (p=1.74 × 10(-13); OR = 2.80, 95%CI, 2.12-3.70) and Australian cohorts (p = 2.78 × 10(-30); OR = 3.15, 95%CI, 2.58-3.86). These associations were similar to those previously identified for the A69S and the rs11200638 variant in these populations that also exhibited high degrees of LD (D' of 0.87-0.99). A major risk haplotype of "T-indel-A" (p = 5.7 × 10(-16); OR = 3.16, 95%CI, 2.34-4.19 and p=6.33 × 10(-30); OR = 3.15, 95%CI, 2.57-3.85) and a protective haplotype of "G-wild type-G" (p=2.35 × 10(-11); OR = 0.39, 95%CI, 0.29-0.52 and p=1.02 × 10(-30); OR = 0.31, 95%CI, 0.25-0.38) were identified in the Indian and Australian cohorts, respectively. CONCLUSIONS: These data provide an independent replication of the association of del443ins54 variant in two different ethnicities, despite differences in allele and haplotype frequencies between them. High levels of LD in both populations limit further genetic dissection of this region in AMD.


Asunto(s)
Estudios de Asociación Genética , Sitios Genéticos/genética , Predisposición Genética a la Enfermedad , Mutación INDEL/genética , Degeneración Macular/genética , Proteínas/genética , Australia , Estudios de Cohortes , Frecuencia de los Genes/genética , Haplotipos/genética , Serina Peptidasa A1 que Requiere Temperaturas Altas , Humanos , India , Desequilibrio de Ligamiento/genética , Polimorfismo de Nucleótido Simple/genética , Factores de Riesgo , Serina Endopeptidasas/genética
12.
Genomics ; 98(6): 412-21, 2011 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-21855625

RESUMEN

Age-related macular degeneration (AMD) is the leading cause of blindness in developed countries. It has been proposed that the polymorphism encoding Y402H (T1277C) in the complement factor H gene (CFH) is one of the main determinants of disease. We genotyped the polymorphism at a number of loci in the region encompassing the Regulators of Complement Activation (RCA) on chromosome 1, including T1277C SNP, in 187 patients and 146 controls. Haplotypes have been classified as protective (P) or susceptible (S) with respect to AMD. This included the identification of an S haplotype with a T at 1277. The results show that no single locus should be assumed to be directly responsible for AMD, but rather argue for the existence of RCA haplotypes, which can be assigned meaningful predictive values for AMD. We conclude that the critical sequences are within a region 450 kb centromeric to 128 kb telomeric of CFH.


Asunto(s)
Activación de Complemento/genética , Sitios Genéticos , Predisposición Genética a la Enfermedad , Haplotipos , Degeneración Macular/genética , Anciano , Anciano de 80 o más Años , Estudios de Casos y Controles , Cromosomas Humanos Par 1 , Factor H de Complemento/genética , Femenino , Frecuencia de los Genes , Genotipo , Humanos , Masculino , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple
13.
Nat Commun ; 12(1): 5118, 2021 08 25.
Artículo en Inglés | MEDLINE | ID: mdl-34433829

RESUMEN

TRP channel-associated factor 1/2 (TCAF1/TCAF2) proteins antagonistically regulate the cold-sensor protein TRPM8 in multiple human tissues. Understanding their significance has been complicated given the locus spans a gap-ridden region with complex segmental duplications in GRCh38. Using long-read sequencing, we sequence-resolve the locus, annotate full-length TCAF models in primate genomes, and show substantial human-specific TCAF copy number variation. We identify two human super haplogroups, H4 and H5, and establish that TCAF duplications originated ~1.7 million years ago but diversified only in Homo sapiens by recurrent structural mutations. Conversely, in all archaic-hominin samples the fixation for a specific H4 haplotype without duplication is likely due to positive selection. Here, our results of TCAF copy number expansion, selection signals in hominins, and differential TCAF2 expression between haplogroups and high TCAF2 and TRPM8 expression in liver and prostate in modern-day humans imply TCAF diversification among hominins potentially in response to cold or dietary adaptations.


Asunto(s)
Duplicación de Gen , Hominidae/genética , Proteínas de la Membrana/genética , Selección Genética , Animales , Variaciones en el Número de Copia de ADN , Evolución Molecular , Genoma Humano , Haplotipos , Humanos , Hombre de Neandertal , Filogenia
14.
Nat Genet ; 52(8): 849-858, 2020 08.
Artículo en Inglés | MEDLINE | ID: mdl-32541924

RESUMEN

Inversions play an important role in disease and evolution but are difficult to characterize because their breakpoints map to large repeats. We increased by sixfold the number (n = 1,069) of previously reported great ape inversions by using single-cell DNA template strand and long-read sequencing. We find that the X chromosome is most enriched (2.5-fold) for inversions, on the basis of its size and duplication content. There is an excess of differentially expressed primate genes near the breakpoints of large (>100 kilobases (kb)) inversions but not smaller events. We show that when great ape lineage-specific duplications emerge, they preferentially (approximately 75%) occur in an inverted orientation compared to that at their ancestral locus. We construct megabase-pair scale haplotypes for individual chromosomes and identify 23 genomic regions that have recurrently toggled between a direct and an inverted state over 15 million years. The direct orientation is most frequently the derived state for human polymorphisms that predispose to recurrent copy number variants associated with neurodevelopmental disease.


Asunto(s)
Inversión Cromosómica/genética , Genoma/genética , Hominidae/genética , Animales , Cromosomas/genética , Variaciones en el Número de Copia de ADN/genética , Evolución Molecular , Femenino , Haplotipos/genética , Humanos , Masculino
15.
Genome Biol ; 21(1): 202, 2020 08 10.
Artículo en Inglés | MEDLINE | ID: mdl-32778141

RESUMEN

BACKGROUND: The complex interspersed pattern of segmental duplications in humans is responsible for rearrangements associated with neurodevelopmental disease, including the emergence of novel genes important in human brain evolution. We investigate the evolution of LCR16a, a putative driver of this phenomenon that encodes one of the most rapidly evolving human-ape gene families, nuclear pore interacting protein (NPIP). RESULTS: Comparative analysis shows that LCR16a has independently expanded in five primate lineages over the last 35 million years of primate evolution. The expansions are associated with independent lineage-specific segmental duplications flanking LCR16a leading to the emergence of large interspersed duplication blocks at non-orthologous chromosomal locations in each primate lineage. The intron-exon structure of the NPIP gene family has changed dramatically throughout primate evolution with different branches showing characteristic gene models yet maintaining an open reading frame. In the African ape lineage, we detect signatures of positive selection that occurred after a transition to more ubiquitous expression among great ape tissues when compared to Old World and New World monkeys. Mouse transgenic experiments from baboon and human genomic loci confirm these expression differences and suggest that the broader ape expression pattern arose due to mutational changes that emerged in cis. CONCLUSIONS: LCR16a promotes serial interspersed duplications and creates hotspots of genomic instability that appear to be an ancient property of primate genomes. Dramatic changes to NPIP gene structure and altered tissue expression preceded major bouts of positive selection in the African ape lineage, suggestive of a gene undergoing strong adaptive evolution.


Asunto(s)
Evolución Molecular , Duplicación de Gen , Primates/genética , Duplicaciones Segmentarias en el Genoma , Animales , Biodiversidad , Encéfalo , Mapeo Cromosómico , Cromosomas , Exones , Fusión Génica , Genoma Humano , Inestabilidad Genómica , Hominidae , Humanos , Filogenia
16.
Sci Rep ; 9(1): 15433, 2019 10 28.
Artículo en Inglés | MEDLINE | ID: mdl-31659207

RESUMEN

Personalized medicine approaches are increasingly sought for diseases with a heritable component. Systemic lupus erythematosus (SLE) is the prototypic autoimmune disease resulting from loss of immunologic tolerance, but the genetic basis of SLE remains incompletely understood. Genome wide association studies (GWAS) identify regions associated with disease, based on common single nucleotide polymorphisms (SNPs) within them, but these SNPs may simply be markers in linkage disequilibrium with other, causative mutations. Here we use an hierarchical screening approach for prediction and testing of true functional variants within regions identified in GWAS; this involved bioinformatic identification of putative regulatory elements within close proximity to SLE SNPs, screening those regions for potentially causative mutations by high resolution melt analysis, and functional validation using reporter assays. Using this approach, we screened 15 SLE associated loci in 143 SLE patients, identifying 7 new variants including 5 SNPs and 2 insertions. Reporter assays revealed that the 5 SNPs were functional, altering enhancer activity. One novel variant was linked to the relatively well characterized rs9888739 SNP at the ITGAM locus, and may explain some of the SLE heritability at this site. Our study demonstrates that non-coding regulatory elements can contain private sequence variants affecting gene expression, which may explain part of the heritability of SLE.


Asunto(s)
Predisposición Genética a la Enfermedad , Desequilibrio de Ligamiento , Lupus Eritematoso Sistémico/genética , Polimorfismo de Nucleótido Simple , Secuencias Reguladoras de Ácidos Nucleicos , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Masculino
17.
Science ; 366(6463)2019 10 18.
Artículo en Inglés | MEDLINE | ID: mdl-31624180

RESUMEN

Copy number variants (CNVs) are subject to stronger selective pressure than single-nucleotide variants, but their roles in archaic introgression and adaptation have not been systematically investigated. We show that stratified CNVs are significantly associated with signatures of positive selection in Melanesians and provide evidence for adaptive introgression of large CNVs at chromosomes 16p11.2 and 8p21.3 from Denisovans and Neanderthals, respectively. Using long-read sequence data, we reconstruct the structure and complex evolutionary history of these polymorphisms and show that both encode positively selected genes absent from most human populations. Our results collectively suggest that large CNVs originating in archaic hominins and introgressed into modern humans have played an important role in local population adaptation and represent an insufficiently studied source of large-scale genetic variation.


Asunto(s)
Introgresión Genética , Animales , Duplicación Cromosómica , Cromosomas Humanos Par 16/genética , Cromosomas Humanos Par 8/genética , Variaciones en el Número de Copia de ADN , Evolución Molecular , Genoma Humano , Haplotipos , Hominidae/genética , Humanos , Melanesia , Modelos Genéticos , Hombre de Neandertal/genética , Polimorfismo Genético , Selección Genética , Secuenciación Completa del Genoma
18.
Nat Commun ; 10(1): 1784, 2019 04 16.
Artículo en Inglés | MEDLINE | ID: mdl-30992455

RESUMEN

The incomplete identification of structural variants (SVs) from whole-genome sequencing data limits studies of human genetic diversity and disease association. Here, we apply a suite of long-read, short-read, strand-specific sequencing technologies, optical mapping, and variant discovery algorithms to comprehensively analyze three trios to define the full spectrum of human genetic variation in a haplotype-resolved manner. We identify 818,054 indel variants (<50 bp) and 27,622 SVs (≥50 bp) per genome. We also discover 156 inversions per genome and 58 of the inversions intersect with the critical regions of recurrent microdeletion and microduplication syndromes. Taken together, our SV callsets represent a three to sevenfold increase in SV detection compared to most standard high-throughput sequencing studies, including those from the 1000 Genomes Project. The methods and the dataset presented serve as a gold standard for the scientific community allowing us to make recommendations for maximizing structural variation sensitivity for future genome sequencing studies.


Asunto(s)
Genoma Humano/genética , Variación Estructural del Genoma , Genómica/métodos , Haplotipos/genética , Algoritmos , Mapeo Cromosómico/métodos , Bases de Datos Genéticas , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Mutación INDEL , Secuenciación Completa del Genoma/métodos
19.
Science ; 360(6393)2018 06 08.
Artículo en Inglés | MEDLINE | ID: mdl-29880660

RESUMEN

Genetic studies of human evolution require high-quality contiguous ape genome assemblies that are not guided by the human reference. We coupled long-read sequence assembly and full-length complementary DNA sequencing with a multiplatform scaffolding approach to produce ab initio chimpanzee and orangutan genome assemblies. By comparing these with two long-read de novo human genome assemblies and a gorilla genome assembly, we characterized lineage-specific and shared great ape genetic variation ranging from single- to mega-base pair-sized variants. We identified ~17,000 fixed human-specific structural variants identifying genic and putative regulatory changes that have emerged in humans since divergence from nonhuman apes. Interestingly, these variants are enriched near genes that are down-regulated in human compared to chimpanzee cerebral organoids, particularly in cells analogous to radial glial neural progenitors.


Asunto(s)
Evolución Molecular , Genoma Humano , Hominidae/genética , Animales , Mapeo Contig , Variación Genética , Humanos , Anotación de Secuencia Molecular , Análisis de Secuencia de ADN
20.
Methods Mol Biol ; 1492: 95-106, 2017.
Artículo en Inglés | MEDLINE | ID: mdl-27822858

RESUMEN

Molecular inversion probes (MIPs) in combination with massively parallel DNA sequencing represent a versatile, yet economical tool for targeted sequencing of genomic DNA. Several thousand genomic targets can be selectively captured using long oligonucleotides containing unique targeting arms and universal linkers. The ability to append sequencing adaptors and sample-specific barcodes allows large-scale pooling and subsequent high-throughput sequencing at relatively low cost per sample. Here, we describe a "wet bench" protocol detailing the capture and subsequent sequencing of >2000 genomic targets from 192 samples, representative of a single lane on the Illumina HiSeq 2000 platform.


Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Sondas Moleculares , Humanos , Reacción en Cadena de la Polimerasa
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA