Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 18 de 18
Filtrar
Más filtros












Base de datos
Intervalo de año de publicación
1.
HGG Adv ; : 100375, 2024 Oct 28.
Artículo en Inglés | MEDLINE | ID: mdl-39468794

RESUMEN

Multiple lines of evidence indicate that ankylosing spondylitis (AS) is a lymphocyte-driven disease. However, which lymphocyte populations are critical in AS pathogenesis is not known. In this study, we aimed to identify the key cell types mediating the genetic risk in AS using an unbiased functional genomics approach. We integrated genome-wide association study (GWAS) data with epigenomic and transcriptomic datasets of human immune cells. To quantify enrichment of cell type-specific open chromatin or gene expression in AS risk loci, we used three published methods - LDSC-SEG, SNPsea, scDRS - that have successfully identified relevant cell types in other diseases. Natural killer (NK) cell-specific open chromatin regions are significantly enriched in heritability for AS, compared to other immune cell types such as T cells, B cells, and monocytes. This finding was consistent between two AS GWAS. Using RNA-seq data, we validated that genes in AS risk loci are enriched in NK cell-specific gene expression. Using the human Space-Time Gut Cell Atlas, we also found significant upregulation of AS-associated genes predominantly in NK cells. We performed co-localization analyses between GWAS risk loci and genetic variants associated with gene expression (eQTL) to find putative target genes. This revealed four AS risk loci affecting regulation of candidate target genes in NK cells: two known loci, ERAP1 and TNFRSF1A, and two under-studied loci, ENTR1 (aka SDCCAG3) and B3GNT2. Our findings suggest that NK cells may play a crucial role in AS development and highlight four putative target genes for functional follow-up in NK cells.

2.
Nat Commun ; 15(1): 6804, 2024 Aug 09.
Artículo en Inglés | MEDLINE | ID: mdl-39122710

RESUMEN

Genome-wide association studies implicate multiple loci in risk for systemic lupus erythematosus (SLE), but few contain exonic variants, rendering systematic identification of non-coding variants essential to decoding SLE genetics. We utilized SNP-seq and bioinformatic enrichment to interrogate 2180 single-nucleotide polymorphisms (SNPs) from 87 SLE risk loci for potential binding of transcription factors and related proteins from B cells. 52 SNPs that passed initial screening were tested by electrophoretic mobility shift and luciferase reporter assays. To validate the approach, we studied rs2297550 in detail, finding that the risk allele enhanced binding to the transcription factor Ikaros (encoded by IKZF1), thereby modulating expression of IKBKE. Correspondingly, primary cells from genotyped healthy donors bearing the risk allele expressed higher levels of the interferon / NF-κB regulator IKKε. Together, these findings define a set of likely functional non-coding lupus risk variants and identify a regulatory pathway involving rs2297550, Ikaros, and IKKε implicated by human genetics in risk for SLE.


Asunto(s)
Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Quinasa I-kappa B , Factor de Transcripción Ikaros , Lupus Eritematoso Sistémico , Polimorfismo de Nucleótido Simple , Lupus Eritematoso Sistémico/genética , Humanos , Factor de Transcripción Ikaros/genética , Factor de Transcripción Ikaros/metabolismo , Quinasa I-kappa B/genética , Quinasa I-kappa B/metabolismo , Predisposición Genética a la Enfermedad/genética , Alelos , Linfocitos B/metabolismo , FN-kappa B/metabolismo , FN-kappa B/genética , Regulación de la Expresión Génica
3.
Cell Genom ; 4(9): 100636, 2024 Sep 11.
Artículo en Inglés | MEDLINE | ID: mdl-39197446

RESUMEN

Asthma is a complex disease caused by genetic and environmental factors. Studies show that wheezing during rhinovirus infection correlates with childhood asthma development. Over 150 non-coding risk variants for asthma have been identified, many affecting gene regulation in T cells, but the effects of most risk variants remain unknown. We hypothesized that airway epithelial cells could also mediate genetic susceptibility to asthma given they are the first line of defense against respiratory viruses and allergens. We integrated genetic data with transcriptomics of airway epithelial cells subject to different stimuli. We demonstrate that rhinovirus infection significantly upregulates childhood-onset asthma-associated genes, particularly in non-ciliated cells. This enrichment is also observed with influenza infection but not with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) or cytokine activation. Overall, our results suggest that rhinovirus infection is an environmental factor that interacts with genetic risk factors through non-ciliated airway epithelial cells to drive childhood-onset asthma.


Asunto(s)
Asma , Células Epiteliales , Predisposición Genética a la Enfermedad , Infecciones por Picornaviridae , Rhinovirus , Humanos , Asma/genética , Asma/virología , Asma/inmunología , Células Epiteliales/virología , Células Epiteliales/metabolismo , Infecciones por Picornaviridae/genética , Infecciones por Picornaviridae/inmunología , Infecciones por Picornaviridae/virología , Niño , Factores de Riesgo , SARS-CoV-2 , Gripe Humana/genética , Gripe Humana/inmunología , Gripe Humana/virología , COVID-19/genética , COVID-19/virología , COVID-19/inmunología
4.
Nat Commun ; 15(1): 2150, 2024 Mar 09.
Artículo en Inglés | MEDLINE | ID: mdl-38459032

RESUMEN

Fine-mapping and functional studies implicate rs117701653, a non-coding single nucleotide polymorphism in the CD28/CTLA4/ICOS locus, as a risk variant for rheumatoid arthritis and type 1 diabetes. Here, using DNA pulldown, mass spectrometry, genome editing and eQTL analysis, we establish that the disease-associated risk allele is functional, reducing affinity for the inhibitory chromosomal regulator SMCHD1 to enhance expression of inducible T-cell costimulator (ICOS) in memory CD4+ T cells from healthy donors. Higher ICOS expression is paralleled by an increase in circulating T peripheral helper (Tph) cells and, in rheumatoid arthritis patients, of blood and joint fluid Tph cells as well as circulating plasmablasts. Correspondingly, ICOS ligation and carriage of the rs117701653 risk allele accelerate T cell differentiation into CXCR5-PD-1high Tph cells producing IL-21 and CXCL13. Thus, mechanistic dissection of a functional non-coding variant in human autoimmunity discloses a previously undefined pathway through which ICOS regulates Tph development and abundance.


Asunto(s)
Artritis Reumatoide , Linfocitos T , Humanos , Linfocitos T/metabolismo , Proteína Coestimuladora de Linfocitos T Inducibles/metabolismo , Antígenos CD28/metabolismo , Alelos , Linfocitos T Colaboradores-Inductores , Proteínas Cromosómicas no Histona/metabolismo
5.
medRxiv ; 2024 Feb 06.
Artículo en Inglés | MEDLINE | ID: mdl-38370648

RESUMEN

Asthma is a complex disease caused by genetic and environmental factors. Epidemiological studies have shown that in children, wheezing during rhinovirus infection (a cause of the common cold) is associated with asthma development during childhood. This has led scientists to hypothesize there could be a causal relationship between rhinovirus infection and asthma or that RV-induced wheezing identifies individuals at increased risk for asthma development. However, not all children who wheeze when they have a cold develop asthma. Genome-wide association studies (GWAS) have identified hundreds of genetic variants contributing to asthma susceptibility, with the vast majority of likely causal variants being non-coding. Integrative analyses with transcriptomic and epigenomic datasets have indicated that T cells drive asthma risk, which has been supported by mouse studies. However, the datasets ascertained in these integrative analyses lack airway epithelial cells. Furthermore, large-scale transcriptomic T cell studies have not identified the regulatory effects of most non-coding risk variants in asthma GWAS, indicating there could be additional cell types harboring these "missing regulatory effects". Given that airway epithelial cells are the first line of defense against rhinovirus, we hypothesized they could be mediators of genetic susceptibility to asthma. Here we integrate GWAS data with transcriptomic datasets of airway epithelial cells subject to stimuli that could induce activation states relevant to asthma. We demonstrate that epithelial cultures infected with rhinovirus significantly upregulate childhood-onset asthma-associated genes. We show that this upregulation occurs specifically in non-ciliated epithelial cells. This enrichment for genes in asthma risk loci, or 'asthma heritability enrichment' is also significant for epithelial genes upregulated with influenza infection, but not with SARS-CoV-2 infection or cytokine activation. Additionally, cells from patients with asthma showed a stronger heritability enrichment compared to cells from healthy individuals. Overall, our results suggest that rhinovirus infection is an environmental factor that interacts with genetic risk factors through non-ciliated airway epithelial cells to drive childhood-onset asthma.

6.
bioRxiv ; 2024 Apr 12.
Artículo en Inglés | MEDLINE | ID: mdl-37645953

RESUMEN

Genome-wide association studies implicate multiple loci in risk for systemic lupus erythematosus (SLE), but few contain exonic variants, rendering systematic identification of non-coding variants essential to decoding SLE genetics. We utilized SNP-seq and bioinformatic enrichment to interrogate 2180 single-nucleotide polymorphisms (SNPs) from 87 SLE risk loci for potential binding of transcription factors and related proteins from B cells. 52 SNPs that passed initial screening were tested by electrophoretic mobility shift and luciferase reporter assays. To validate the approach, we studied rs2297550 in detail, finding that the risk allele enhanced binding to the transcription factor Ikaros (IKZF1), thereby modulating expression of IKBKE. Correspondingly, primary cells from genotyped healthy donors bearing the risk allele expressed higher levels of the interferon / NF-κB regulator IKKϵ. Together, these findings define a set of likely functional non-coding lupus risk variants and identify a new regulatory pathway involving rs2297550, Ikaros, and IKKϵ implicated by human genetics in risk for SLE.

7.
Nat Genet ; 55(12): 2255-2268, 2023 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-38036787

RESUMEN

The human leukocyte antigen (HLA) locus plays a critical role in complex traits spanning autoimmune and infectious diseases, transplantation and cancer. While coding variation in HLA genes has been extensively documented, regulatory genetic variation modulating HLA expression levels has not been comprehensively investigated. Here we mapped expression quantitative trait loci (eQTLs) for classical HLA genes across 1,073 individuals and 1,131,414 single cells from three tissues. To mitigate technical confounding, we developed scHLApers, a pipeline to accurately quantify single-cell HLA expression using personalized reference genomes. We identified cell-type-specific cis-eQTLs for every classical HLA gene. Modeling eQTLs at single-cell resolution revealed that many eQTL effects are dynamic across cell states even within a cell type. HLA-DQ genes exhibit particularly cell-state-dependent effects within myeloid, B and T cells. For example, a T cell HLA-DQA1 eQTL ( rs3104371 ) is strongest in cytotoxic cells. Dynamic HLA regulation may underlie important interindividual variability in immune responses.


Asunto(s)
Regulación de la Expresión Génica , Sitios de Carácter Cuantitativo , Humanos , Regulación de la Expresión Génica/genética , Sitios de Carácter Cuantitativo/genética , Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple
8.
medRxiv ; 2023 Sep 26.
Artículo en Inglés | MEDLINE | ID: mdl-37808698

RESUMEN

Objective: Multiple lines of evidence indicate that ankylosing spondylitis (AS) is a lymphocyte-driven disease. However, which lymphocyte populations are critical in AS pathogenesis is not known. In this study, we aimed to identify the key cell types mediating the genetic risk in AS using an unbiased integrative functional genomics approach. Methods: We integrated GWAS data with epigenomic and transcriptomic datasets of immune cells in healthy humans. To quantify enrichment of cell type-specific open chromatin regions or gene expression in AS risk loci, we used three published methods which have identified cell types for other diseases. Additionally, we performed co-localization analyses between GWAS risk loci and genetic variants associated with gene expression (eQTL) to find putative target genes of AS risk variants. Results: Natural killer (NK) cell-specific open chromatin regions are significantly enriched in heritability for AS, compared to other immune cell types such as T cells, B cells, and monocytes. This finding was consistent between two AS GWAS. Using RNA-seq data, we validated that genes in AS risk loci are enriched in NK cell-specific gene expression. Expression levels of AS-associated genes, such as RUNX3, TBX21, TNFRSF1A, and NPEPPS, were found to be highest in NK cells compared to five T cell subsets. Using the human Space-Time Gut Cell Atlas we found significant upregulation of AS-associated genes predominantly in NK cells. Co-localization analysis revealed four AS risk loci affecting regulation of candidate target genes in NK cells: two known loci, ERAP1 and TNFRSF1A, and two under-studied loci, ENTR1 (aka SDCCAG3) and B3GNT2. Conclusion: Our results point to NK cells as potential key drivers in the development of AS and highlight four putative target genes for functional follow-up in NK cells.

9.
medRxiv ; 2023 Mar 20.
Artículo en Inglés | MEDLINE | ID: mdl-36993194

RESUMEN

The human leukocyte antigen (HLA) locus plays a critical role in complex traits spanning autoimmune and infectious diseases, transplantation, and cancer. While coding variation in HLA genes has been extensively documented, regulatory genetic variation modulating HLA expression levels has not been comprehensively investigated. Here, we mapped expression quantitative trait loci (eQTLs) for classical HLA genes across 1,073 individuals and 1,131,414 single cells from three tissues, using personalized reference genomes to mitigate technical confounding. We identified cell-type-specific cis-eQTLs for every classical HLA gene. Modeling eQTLs at single-cell resolution revealed that many eQTL effects are dynamic across cell states even within a cell type. HLA-DQ genes exhibit particularly cell-state-dependent effects within myeloid, B, and T cells. Dynamic HLA regulation may underlie important interindividual variability in immune responses.

10.
Immunogenetics ; 75(3): 249-262, 2023 06.
Artículo en Inglés | MEDLINE | ID: mdl-36707444

RESUMEN

Human leukocyte antigen (HLA) class I and II loci are essential elements of innate and acquired immunity. Their functions include antigen presentation to T cells leading to cellular and humoral immune responses, and modulation of NK cells. Their exceptional influence on disease outcome has now been made clear by genome-wide association studies. The exons encoding the peptide-binding groove have been the main focus for determining HLA effects on disease susceptibility/pathogenesis. However, HLA expression levels have also been implicated in disease outcome, adding another dimension to the extreme diversity of HLA that impacts variability in immune responses across individuals. To estimate HLA expression, immunogenetic studies traditionally rely on quantitative PCR (qPCR). Adoption of alternative high-throughput technologies such as RNA-seq has been hampered by technical issues due to the extreme polymorphism at HLA genes. Recently, however, multiple bioinformatic methods have been developed to accurately estimate HLA expression from RNA-seq data. This opens an exciting opportunity to quantify HLA expression in large datasets but also brings questions on whether RNA-seq results are comparable to those by qPCR. In this study, we analyze three classes of expression data for HLA class I genes for a matched set of individuals: (a) RNA-seq, (b) qPCR, and (c) cell surface HLA-C expression. We observed a moderate correlation between expression estimates from qPCR and RNA-seq for HLA-A, -B, and -C (0.2 ≤ rho ≤ 0.53). We discuss technical and biological factors which need to be accounted for when comparing quantifications for different molecular phenotypes or using different techniques.


Asunto(s)
Estudio de Asociación del Genoma Completo , Antígenos de Histocompatibilidad Clase I , Humanos , RNA-Seq , Antígenos de Histocompatibilidad Clase I/genética , Antígenos HLA-C/genética , Reacción en Cadena de la Polimerasa
11.
Genet Mol Biol ; 44(1 Suppl 1): e20210036, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34436508

RESUMEN

Meeting the challenges brought by the COVID-19 pandemic requires an interdisciplinary approach. In this context, integrating knowledge of immune function with an understanding of how genetic variation influences the nature of immunity is a key challenge. Immunogenetics can help explain the heterogeneity of susceptibility and protection to the viral infection and disease progression. Here, we review the knowledge developed so far, discussing fundamental genes for triggering the innate and adaptive immune responses associated with a viral infection, especially with the SARS-CoV-2 mechanisms. We emphasize the role of the HLA and KIR genes, discussing what has been uncovered about their role in COVID-19 and addressing methodological challenges of studying these genes. Finally, we comment on questions that arise when studying admixed populations, highlighting the case of Brazil. We argue that the interplay between immunology and an understanding of genetic associations can provide an important contribution to our knowledge of COVID-19.

12.
Transfusion ; 61(3): 678-681, 2021 03.
Artículo en Inglés | MEDLINE | ID: mdl-33084042

RESUMEN

BACKGROUND: When the mother's DNA profile is not available for paternity testing, there is a smaller probability that a locus will exclude an alleged father. This study aims to evaluate the risk of potential false paternity inclusions in motherless cases. STUDY DESIGN AND METHODS: More than 20 000 duos were generated by removing the maternal genotypes from exclusion trios. After recalculating paternity in these duos, any found inclusions would be false. RESULTS: The use of an appropriate number of loci, mutation model, and mutation rates to analyze motherless paternity cases was robust against false inclusions. A single potential false inclusion was observed in a case wherein kinship plays a role. This result highlights the importance of testing the mother when available and of obtaining information on family circumstances for the proper handling of cases involving related individuals. CONCLUSION: The guidelines we used here were sufficient to avoid false inclusions in a data set of more than 20 000 motherless cases.


Asunto(s)
Sitios Genéticos , Repeticiones de Microsatélite , Alelos , Simulación por Computador , Femenino , Genotipo , Humanos , Masculino , Madres , Mutación , Paternidad , Probabilidad
13.
Front Immunol ; 11: 584950, 2020.
Artículo en Inglés | MEDLINE | ID: mdl-33240273

RESUMEN

A match of HLA loci between patients and donors is critical for successful hematopoietic stem cell transplantation. However, the extreme polymorphism of HLA loci - an outcome of millions of years of natural selection - reduces the chances that two individuals will carry identical combinations of multilocus HLA genotypes. Further, HLA variability is not homogeneously distributed throughout the world: African populations on average have greater variability than non-Africans, reducing the chances that two unrelated African individuals are HLA identical. Here, we explore how self-identification (often equated with "ethnicity" or "race") and genetic ancestry are related to the chances of finding HLA compatible donors in a large sample from Brazil, a highly admixed country. We query REDOME, Brazil's Bone Marrow Registry, and investigate how different criteria for identifying ancestry influence the chances of finding a match. We find that individuals who self-identify as "Black" and "Mixed" on average have lower chances of finding matches than those who self-identify as "White" (up to 57% reduction). We next show that an individual's African genetic ancestry, estimated using molecular markers and quantified as the proportion of an individual's genome that traces its ancestry to Africa, is strongly associated with reduced chances of finding a match (up to 60% reduction). Finally, we document that the strongest reduction in chances of finding a match is associated with having an MHC region of exclusively African ancestry (up to 75% reduction). We apply our findings to a specific condition, for which there is a clinical indication for transplantation: sickle-cell disease. We show that the increased African ancestry in patients with this disease leads to reduced chances of finding a match, when compared to the remainder of the sample, without the condition. Our results underscore the influence of ancestry on chances of finding compatible HLA matches, and indicate that efforts guided to increasing the African component of registries are necessary.


Asunto(s)
Anemia de Células Falciformes/genética , Población Negra/genética , Médula Ósea/cirugía , Trasplante de Médula Ósea/métodos , Brasil , Etnicidad/genética , Frecuencia de los Genes/genética , Genotipo , Antígenos HLA/genética , Trasplante de Células Madre Hematopoyéticas/métodos , Prueba de Histocompatibilidad/métodos , Humanos , Polimorfismo Genético/genética , Sistema de Registros , Donante no Emparentado , Población Blanca/genética
14.
Methods Mol Biol ; 2120: 101-112, 2020.
Artículo en Inglés | MEDLINE | ID: mdl-32124314

RESUMEN

The plethora of RNA-seq data which have been generated in the recent years constitutes an attractive resource to investigate HLA variation and its relationship with normal and disease phenotypes, such as cancer. However, next generation sequencing (NGS) brings new challenges to HLA analysis because of the mapping bias introduced by aligning short reads originated from polymorphic genes to a single reference genome. Here we describe HLApers, a pipeline which adapts widely used tools for analysis of standard RNA-seq data to infer HLA genotypes and estimate expression. By generating reliable expression estimates for each HLA allele that an individual carries, HLApers allows a better understanding of the relationship between HLA alleles and phenotypes manifested by an individual.


Asunto(s)
Antígenos HLA/genética , Prueba de Histocompatibilidad/métodos , Análisis de Secuencia de ARN/métodos , Alelos , Expresión Génica , Frecuencia de los Genes , Genotipo , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Fenotipo
15.
PLoS Genet ; 15(4): e1008091, 2019 04.
Artículo en Inglés | MEDLINE | ID: mdl-31009447

RESUMEN

The HLA (Human Leukocyte Antigens) genes are well-documented targets of balancing selection, and variation at these loci is associated with many disease phenotypes. Variation in expression levels also influences disease susceptibility and resistance, but little information exists about the regulation and population-level patterns of expression. This results from the difficulty in mapping short reads originated from these highly polymorphic loci, and in accounting for the existence of several paralogues. We developed a computational pipeline to accurately estimate expression for HLA genes based on RNA-seq, improving both locus-level and allele-level estimates. First, reads are aligned to all known HLA sequences in order to infer HLA genotypes, then quantification of expression is carried out using a personalized index. We use simulations to show that expression estimates obtained in this way are not biased due to divergence from the reference genome. We applied our pipeline to the GEUVADIS dataset, and compared the quantifications to those obtained with reference transcriptome. Although the personalized pipeline recovers more reads, we found that using the reference transcriptome produces estimates similar to the personalized pipeline (r ≥ 0.87) with the exception of HLA-DQA1. We describe the impact of the HLA-personalized approach on downstream analyses for nine classical HLA loci (HLA-A, HLA-C, HLA-B, HLA-DRA, HLA-DRB1, HLA-DQA1, HLA-DQB1, HLA-DPA1, HLA-DPB1). Although the influence of the HLA-personalized approach is modest for eQTL mapping, the p-values and the causality of the eQTLs obtained are better than when the reference transcriptome is used. We investigate how the eQTLs we identified explain variation in expression among lineages of HLA alleles. Finally, we discuss possible causes underlying differences between expression estimates obtained using RNA-seq, antibody-based approaches and qPCR.


Asunto(s)
Mapeo Cromosómico , Expresión Génica , Antígenos HLA/genética , Sitios de Carácter Cuantitativo , Alelos , Biología Computacional/métodos , Frecuencia de los Genes , Genotipo , Haplotipos , Humanos , Transcriptoma
16.
Forensic Sci Int Genet ; 19: 86-91, 2015 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-26186694

RESUMEN

Large forensic databases provide an opportunity to compare observed empirical rates of genotype matching with those expected under forensic genetic models. A number of researchers have taken advantage of this opportunity to validate some forensic genetic approaches, particularly to ensure that estimated rates of genotype matching between unrelated individuals are indeed slight overestimates of those observed. However, these studies have also revealed systematic error trends in genotype probability estimates. In this analysis, we investigate these error trends and show how they result from inappropriate implementation of the Balding-Nichols model in the context of database-wide matching. Specifically, we show that in addition to accounting for increased allelic matching between individuals with recent shared ancestry, studies must account for relatively decreased allelic matching between individuals with more ancient shared ancestry.


Asunto(s)
Bases de Datos Genéticas , Genética Forense , Modelos Genéticos , Alelos , Humanos , Funciones de Verosimilitud
17.
G3 (Bethesda) ; 5(5): 931-41, 2015 Mar 17.
Artículo en Inglés | MEDLINE | ID: mdl-25787242

RESUMEN

Next-generation sequencing (NGS) technologies have become the standard for data generation in studies of population genomics, as the 1000 Genomes Project (1000G). However, these techniques are known to be problematic when applied to highly polymorphic genomic regions, such as the human leukocyte antigen (HLA) genes. Because accurate genotype calls and allele frequency estimations are crucial to population genomics analyses, it is important to assess the reliability of NGS data. Here, we evaluate the reliability of genotype calls and allele frequency estimates of the single-nucleotide polymorphisms (SNPs) reported by 1000G (phase I) at five HLA genes (HLA-A, -B, -C, -DRB1, and -DQB1). We take advantage of the availability of HLA Sanger sequencing of 930 of the 1092 1000G samples and use this as a gold standard to benchmark the 1000G data. We document that 18.6% of SNP genotype calls in HLA genes are incorrect and that allele frequencies are estimated with an error greater than ±0.1 at approximately 25% of the SNPs in HLA genes. We found a bias toward overestimation of reference allele frequency for the 1000G data, indicating mapping bias is an important cause of error in frequency estimation in this dataset. We provide a list of sites that have poor allele frequency estimates and discuss the outcomes of including those sites in different kinds of analyses. Because the HLA region is the most polymorphic in the human genome, our results provide insights into the challenges of using of NGS data at other genomic regions of high diversity.


Asunto(s)
Alelos , Mapeo Cromosómico , Frecuencia de los Genes , Genómica , Antígenos HLA/genética , Genética de Población , Genoma Humano , Genómica/métodos , Genotipo , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos
18.
Forensic Sci Int Genet ; 13: e6-e12, 2014 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-24888518

RESUMEN

We have reported the allele frequencies of 15 STR loci, including the original 13 CODIS core loci, in over 100,000 Brazilian individuals. A new CODIS core loci has been proposed, but the recently established Brazilian Integrated Network of DNA Databases made a decision in 2010 to postpone the implementation of this new set of loci due to the lack of allele frequency data for the Brazilian population. We aimed to report allele frequencies of 20 loci, estimated from 96,400 Brazilian individuals undergoing paternity testing during 2011-2013. The percentage of missing data was less than 0.6% for all loci, except for CSF1PO (3.15%) and D7S820 (2.5%). The dropout rates estimated by the MicroDrop software were 0.013 for CSF1PO, 0.000037 for D7S820 and less than 0.000001 for other loci. Small missing data percentages and dropout rates reflect the high quality of the data.


Asunto(s)
Frecuencia de los Genes , Genética de Población , Repeticiones de Microsatélite , Brasil , Dermatoglifia del ADN , Bases de Datos de Ácidos Nucleicos , Humanos , Reacción en Cadena de la Polimerasa
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...