RESUMEN
pH sensing by GPR65 regulates various inflammatory conditions, but its role in skin remains unknown. In this study, we performed a phenome-wide association study and report that the T allele of GPR65-intronic single-nucleotide polymorphism rs8005161, which reduces GPR65 signaling, showed a significant association with atopic dermatitis, in addition to inflammatory bowel diseases and asthma, as previously reported. Consistent with this genetic association in humans, we show that deficiency of GPR65 in mice resulted in markedly exacerbated disease in the MC903 experimental model of atopic dermatitis. Deficiency of GPR65 also increased neutrophil migration in vitro. Moreover, GPR65 deficiency in mice resulted in higher expression of the inflammatory cytokine TNF-α by T cells. In humans, CD4+ T cells from rs8005161 heterozygous individuals expressed higher levels of TNF-α after PMA/ionomycin stimulation, particularly under pH 6 conditions. pH sensing by GPR65 appears to be important for regulating the pathogenesis of atopic dermatitis.
Asunto(s)
Dermatitis Atópica/inmunología , Protones , Animales , Movimiento Celular/inmunología , Concentración de Iones de Hidrógeno , Ratones , Ratones Endogámicos C57BL , Neutrófilos/inmunología , Receptores Acoplados a Proteínas G/análisis , Receptores Acoplados a Proteínas G/deficiencia , Receptores Acoplados a Proteínas G/inmunologíaRESUMEN
Educational attainment is strongly influenced by social and other environmental factors, but genetic factors are estimated to account for at least 20% of the variation across individuals. Here we report the results of a genome-wide association study (GWAS) for educational attainment that extends our earlier discovery sample of 101,069 individuals to 293,723 individuals, and a replication study in an independent sample of 111,349 individuals from the UK Biobank. We identify 74 genome-wide significant loci associated with the number of years of schooling completed. Single-nucleotide polymorphisms associated with educational attainment are disproportionately found in genomic regions regulating gene expression in the fetal brain. Candidate genes are preferentially expressed in neural tissue, especially during the prenatal period, and enriched for biological pathways involved in neural development. Our findings demonstrate that, even for a behavioural phenotype that is mostly environmentally determined, a well-powered GWAS identifies replicable associated genetic variants that suggest biologically relevant pathways. Because educational attainment is measured in large numbers of individuals, it will continue to be useful as a proxy phenotype in efforts to characterize the genetic influences of related phenotypes, including cognition and neuropsychiatric diseases.
Asunto(s)
Encéfalo/metabolismo , Escolaridad , Feto/metabolismo , Regulación de la Expresión Génica/genética , Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple/genética , Enfermedad de Alzheimer/genética , Trastorno Bipolar/genética , Cognición , Biología Computacional , Interacción Gen-Ambiente , Humanos , Anotación de Secuencia Molecular , Esquizofrenia/genética , Reino UnidoRESUMEN
The development of high-throughput genomic technologies has impacted many areas of genetic research. While many applications of these technologies focus on the discovery of genes involved in disease from population samples, applications of genomic technologies to an individual's genome or personal genomics have recently gained much interest. One such application is the identification of relatives from genetic data. In this application, genetic information from a set of individuals is collected in a database, and each pair of individuals is compared in order to identify genetic relatives. An inherent issue that arises in the identification of relatives is privacy. In this article, we propose a method for identifying genetic relatives without compromising privacy by taking advantage of novel cryptographic techniques customized for secure and private comparison of genetic information. We demonstrate the utility of these techniques by allowing a pair of individuals to discover whether or not they are related without compromising their genetic information or revealing it to a third party. The idea is that individuals only share enough special-purpose cryptographically protected information with each other to identify whether or not they are relatives, but not enough to expose any information about their genomes. We show in HapMap and 1000 Genomes data that our method can recover first- and second-order genetic relationships and, through simulations, show that our method can identify relationships as distant as third cousins while preserving privacy.
Asunto(s)
Privacidad Genética , Investigación Genética , Genoma Humano , Familia , Genómica , Proyecto Mapa de Haplotipos , Proyecto Genoma Humano , HumanosRESUMEN
We report genome sequences of 17 inbred strains of laboratory mice and identify almost ten times more variants than previously known. We use these genomes to explore the phylogenetic history of the laboratory mouse and to examine the functional consequences of allele-specific variation on transcript abundance, revealing that at least 12% of transcripts show a significant tissue-specific expression bias. By identifying candidate functional variants at 718 quantitative trait loci we show that the molecular nature of functional variants and their position relative to genes vary according to the effect size of the locus. These sequences provide a starting point for a new era in the functional analysis of a key model organism.
Asunto(s)
Regulación de la Expresión Génica/genética , Variación Genética/genética , Genoma/genética , Ratones Endogámicos/genética , Ratones/genética , Fenotipo , Alelos , Animales , Animales de Laboratorio/genética , Genómica , Ratones/clasificación , Ratones Endogámicos C57BL/genética , Filogenia , Sitios de Carácter Cuantitativo/genéticaRESUMEN
Identifying environmentally-specific genetic effects is a key challenge in understanding the structure of complex traits. Model organisms play a crucial role in the identification of such gene-by-environment interactions, as a result of the unique ability to observe genetically similar individuals across multiple distinct environments. Many model organism studies examine the same traits but under varying environmental conditions. For example, knock-out or diet-controlled studies are often used to examine cholesterol in mice. These studies, when examined in aggregate, provide an opportunity to identify genomic loci exhibiting environmentally-dependent effects. However, the straightforward application of traditional methodologies to aggregate separate studies suffers from several problems. First, environmental conditions are often variable and do not fit the standard univariate model for interactions. Additionally, applying a multivariate model results in increased degrees of freedom and low statistical power. In this paper, we jointly analyze multiple studies with varying environmental conditions using a meta-analytic approach based on a random effects model to identify loci involved in gene-by-environment interactions. Our approach is motivated by the observation that methods for discovering gene-by-environment interactions are closely related to random effects models for meta-analysis. We show that interactions can be interpreted as heterogeneity and can be detected without utilizing the traditional uni- or multi-variate approaches for discovery of gene-by-environment interactions. We apply our new method to combine 17 mouse studies containing in aggregate 4,965 distinct animals. We identify 26 significant loci involved in High-density lipoprotein (HDL) cholesterol, many of which are consistent with previous findings. Several of these loci show significant evidence of involvement in gene-by-environment interactions. An additional advantage of our meta-analysis approach is that our combined study has significantly higher power and improved resolution compared to any single study thus explaining the large number of loci discovered in the combined study.
Asunto(s)
HDL-Colesterol/genética , Interacción Gen-Ambiente , Sitios de Carácter Cuantitativo/genética , Animales , Ambiente , Genoma , Ratones , Modelos TeóricosRESUMEN
The use of mixed models to determine narrow-sense heritability and related quantities such as SNP heritability has received much recent attention. Less attention has been paid to the inherent variability in these estimates. One approach for quantifying variability in estimates of heritability is a frequentist approach, in which heritability is estimated using maximum likelihood and its variance is quantified through an asymptotic normal approximation. An alternative approach is to quantify the uncertainty in heritability through its Bayesian posterior distribution. In this paper, we develop the latter approach, make it computationally efficient and compare it to the frequentist approach. We show theoretically that, for a sufficiently large sample size and intermediate values of heritability, the two approaches provide similar results. Using the Atherosclerosis Risk in Communities cohort, we show empirically that the two approaches can give different results and that the variance/uncertainty can remain large.
Asunto(s)
Patrón de Herencia , Modelos Genéticos , Carácter Cuantitativo Heredable , Incertidumbre , Algoritmos , Aterosclerosis/genética , Teorema de Bayes , Estudios de Cohortes , Femenino , Genotipo , Humanos , Masculino , Fenotipo , Polimorfismo de Nucleótido Simple , Grupos Raciales/genética , Factores de RiesgoRESUMEN
Significant advances have been made in the discovery of genes affecting bone mineral density (BMD); however, our understanding of its genetic basis remains incomplete. In the current study, genome-wide association (GWA) and co-expression network analysis were used in the recently described Hybrid Mouse Diversity Panel (HMDP) to identify and functionally characterize novel BMD genes. In the HMDP, a GWA of total body, spinal, and femoral BMD revealed four significant associations (-log10P>5.39) affecting at least one BMD trait on chromosomes (Chrs.) 7, 11, 12, and 17. The associations implicated a total of 163 genes with each association harboring between 14 and 112 genes. This list was reduced to 26 functional candidates by identifying those genes that were regulated by local eQTL in bone or harbored potentially functional non-synonymous (NS) SNPs. This analysis revealed that the most significant BMD SNP on Chr. 12 was a NS SNP in the additional sex combs like-2 (Asxl2) gene that was predicted to be functional. The involvement of Asxl2 in the regulation of bone mass was confirmed by the observation that Asxl2 knockout mice had reduced BMD. To begin to unravel the mechanism through which Asxl2 influenced BMD, a gene co-expression network was created using cortical bone gene expression microarray data from the HMDP strains. Asxl2 was identified as a member of a co-expression module enriched for genes involved in the differentiation of myeloid cells. In bone, osteoclasts are bone-resorbing cells of myeloid origin, suggesting that Asxl2 may play a role in osteoclast differentiation. In agreement, the knockdown of Asxl2 in bone marrow macrophages impaired their ability to form osteoclasts. This study identifies a new regulator of BMD and osteoclastogenesis and highlights the power of GWA and systems genetics in the mouse for dissecting complex genetic traits.
Asunto(s)
Densidad Ósea/genética , Osteoclastos/citología , Osteogénesis/genética , Proteínas Represoras/genética , Proteínas Represoras/metabolismo , Alelos , Animales , Cromosomas de los Mamíferos , Perfilación de la Expresión Génica , Regulación del Desarrollo de la Expresión Génica/genética , Redes Reguladoras de Genes/genética , Estudio de Asociación del Genoma Completo , Masculino , Ratones , Ratones Noqueados , Anotación de Secuencia Molecular , Polimorfismo de Nucleótido Simple/genéticaRESUMEN
The relationships between the levels of transcripts and the levels of the proteins they encode have not been examined comprehensively in mammals, although previous work in plants and yeast suggest a surprisingly modest correlation. We have examined this issue using a genetic approach in which natural variations were used to perturb both transcript levels and protein levels among inbred strains of mice. We quantified over 5,000 peptides and over 22,000 transcripts in livers of 97 inbred and recombinant inbred strains and focused on the 7,185 most heritable transcripts and 486 most reliable proteins. The transcript levels were quantified by microarray analysis in three replicates and the proteins were quantified by Liquid Chromatography-Mass Spectrometry using O(18)-reference-based isotope labeling approach. We show that the levels of transcripts and proteins correlate significantly for only about half of the genes tested, with an average correlation of 0.27, and the correlations of transcripts and proteins varied depending on the cellular location and biological function of the gene. We examined technical and biological factors that could contribute to the modest correlation. For example, differential splicing clearly affects the analyses for certain genes; but, based on deep sequencing, this does not substantially contribute to the overall estimate of the correlation. We also employed genome-wide association analyses to map loci controlling both transcript and protein levels. Surprisingly, little overlap was observed between the protein- and transcript-mapped loci. We have typed numerous clinically relevant traits among the strains, including adiposity, lipoprotein levels, and tissue parameters. Using correlation analysis, we found that a low number of clinical trait relationships are preserved between the protein and mRNA gene products and that the majority of such relationships are specific to either the protein levels or transcript levels. Surprisingly, transcript levels were more strongly correlated with clinical traits than protein levels. In light of the widespread use of high-throughput technologies in both clinical and basic research, the results presented have practical as well as basic implications.
Asunto(s)
Perfilación de la Expresión Génica , Variación Genética , Proteoma/análisis , Empalme Alternativo , Animales , Estudio de Asociación del Genoma Completo , Ratones , Proteoma/genética , Proteómica , ARN Mensajero/metabolismoRESUMEN
The Plasmodium falciparum parasite's ability to adapt to environmental pressures, such as the human immune system and antimalarial drugs, makes malaria an enduring burden to public health. Understanding the genetic basis of these adaptations is critical to intervening successfully against malaria. To that end, we created a high-density genotyping array that assays over 17,000 single nucleotide polymorphisms (â¼ 1 SNP/kb), and applied it to 57 culture-adapted parasites from three continents. We characterized genome-wide genetic diversity within and between populations and identified numerous loci with signals of natural selection, suggesting their role in recent adaptation. In addition, we performed a genome-wide association study (GWAS), searching for loci correlated with resistance to thirteen antimalarials; we detected both known and novel resistance loci, including a new halofantrine resistance locus, PF10_0355. Through functional testing we demonstrated that PF10_0355 overexpression decreases sensitivity to halofantrine, mefloquine, and lumefantrine, but not to structurally unrelated antimalarials, and that increased gene copy number mediates resistance. Our GWAS and follow-on functional validation demonstrate the potential of genome-wide studies to elucidate functionally important loci in the malaria parasite genome.
Asunto(s)
Antimaláricos/farmacología , Resistencia a Medicamentos/genética , Sitios Genéticos , Plasmodium falciparum/genética , Etanolaminas/farmacología , Fluorenos/farmacología , Dosificación de Gen , Expresión Génica , Estudios de Asociación Genética , Variación Genética , Genotipo , Haplotipos , Desequilibrio de Ligamiento , Lumefantrina , Malaria Falciparum/parasitología , Malaria Falciparum/prevención & control , Mefloquina/farmacología , Fenantrenos/farmacología , Plasmodium falciparum/efectos de los fármacos , Polimorfismo de Nucleótido Simple , Selección GenéticaRESUMEN
Restless legs syndrome (RLS) affects up to 10% of older adults. Their healthcare is impeded by delayed diagnosis and insufficient treatment. To advance disease prediction and find new entry points for therapy, we performed meta-analyses of genome-wide association studies in 116,647 individuals with RLS (cases) and 1,546,466 controls of European ancestry. The pooled analysis increased the number of risk loci eightfold to 164, including three on chromosome X. Sex-specific meta-analyses revealed largely overlapping genetic predispositions of the sexes (rg = 0.96). Locus annotation prioritized druggable genes such as glutamate receptors 1 and 4, and Mendelian randomization indicated RLS as a causal risk factor for diabetes. Machine learning approaches combining genetic and nongenetic information performed best in risk prediction (area under the curve (AUC) = 0.82-0.91). In summary, we identified targets for drug development and repurposing, prioritized potential causal relationships between RLS and relevant comorbidities and risk factors for follow-up and provided evidence that nonlinear interactions are likely relevant to RLS risk prediction.
Asunto(s)
Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Síndrome de las Piernas Inquietas , Síndrome de las Piernas Inquietas/genética , Humanos , Factores de Riesgo , Femenino , Masculino , Polimorfismo de Nucleótido Simple , Análisis de la Aleatorización Mendeliana , Aprendizaje AutomáticoRESUMEN
Many genome-wide association studies have been performed on population cohorts that contain phenotype measurements at multiple time points. However, standard association methodologies only consider one time point. In this paper, we propose a mixed-model-based approach for performing association mapping which utilizes multiple phenotype measurements for each individual. We introduce an analytical approach to calculate statistical power and show that this model leads to increased power when compared to traditional approaches. Moreover, we show that by using this model we are able to differentiate the genetic, environmental, and residual error contributions to the phenotype. Using predictions of these components, we show how the proportion of the phenotype due to environment and genetics can be predicted and show that the ranking of individuals based on these predictions is very accurate. The software implementing this method may be found at http://genetics.cs.ucla.edu/longGWAS/.
Asunto(s)
Mapeo Cromosómico/métodos , Algoritmos , Estudios de Cohortes , Simulación por Computador , Estudio de Asociación del Genoma Completo , Humanos , Modelos Genéticos , Modelos Estadísticos , Fenotipo , Reproducibilidad de los Resultados , Programas InformáticosRESUMEN
Systems genetics relies on common genetic variants to elucidate biologic networks contributing to complex disease-related phenotypes. Mice are ideal model organisms for such approaches, but linkage analysis has been only modestly successful due to low mapping resolution. Association analysis in mice has the potential of much better resolution, but it is confounded by population structure and inadequate power to map traits that explain less than 10% of the variance, typical of mouse quantitative trait loci (QTL). We report a novel strategy for association mapping that combines classic inbred strains for mapping resolution and recombinant inbred strains for mapping power. Using a mixed model algorithm to correct for population structure, we validate the approach by mapping over 2500 cis-expression QTL with a resolution an order of magnitude narrower than traditional QTL analysis. We also report the fine mapping of metabolic traits such as plasma lipids. This resource, termed the Hybrid Mouse Diversity Panel, makes possible the integration of multiple data sets and should prove useful for systems-based approaches to complex traits and studies of gene-by-environment interactions.
Asunto(s)
Mapeo Cromosómico/métodos , Estudio de Asociación del Genoma Completo/métodos , Sitios de Carácter Cuantitativo/genética , Algoritmos , Animales , Ligamiento Genético , Lipoproteínas HDL/genética , Masculino , Ratones , Ratones Endogámicos , FenotipoRESUMEN
MOTIVATION: Structural variations and in particular copy number variations (CNVs) have dramatic effects of disease and traits. Technologies for identifying CNVs have been an active area of research for over 10 years. The current generation of high-throughput sequencing techniques presents new opportunities for identification of CNVs. Methods that utilize these technologies map sequencing reads to a reference genome and look for signatures which might indicate the presence of a CNV. These methods work well when CNVs lie within unique genomic regions. However, the problem of CNV identification and reconstruction becomes much more challenging when CNVs are in repeat-rich regions, due to the multiple mapping positions of the reads. RESULTS: In this study, we propose an efficient algorithm to handle these multi-mapping reads such that the CNVs can be reconstructed with high accuracy even for repeat-rich regions. To our knowledge, this is the first attempt to both identify and reconstruct CNVs in repeat-rich regions. Our experiments show that our method is not only computationally efficient but also accurate.
Asunto(s)
Algoritmos , Variaciones en el Número de Copia de ADN , Secuencias Repetitivas de Ácidos Nucleicos , Animales , Mapeo Cromosómico , ADN/química , Genómica/métodos , RatonesRESUMEN
MOTIVATION: The analysis of gene coexpression is at the core of many types of genetic analysis. The coexpression between two genes can be calculated by using a traditional Pearson's correlation coefficient. However, unobserved confounding effects may cause inflation of the Pearson's correlation so that uncorrelated genes appear correlated. Many general methods have been suggested, which aim to remove the effects of confounding from gene expression data. However, the residual confounding which is not accounted for by these generic correction procedures has the potential to induce correlation between genes. Therefore, a method that specifically aims to calculate gene coexpression between gene expression arrays, while accounting for confounding effects, is desirable. RESULTS: In this article, we present a statistical model for calculating gene coexpression called mixed model coexpression (MMC), which models coexpression within a mixed model framework. Confounding effects are expected to be encoded in the matrix representing the correlation between arrays, the inter-sample correlation matrix. By conditioning on the information in the inter-sample correlation matrix, MMC is able to produce gene coexpressions that are not influenced by global confounding effects and thus significantly reduce the number of spurious coexpressions observed. We applied MMC to both human and yeast datasets and show it is better able to effectively prioritize strong coexpressions when compared to a traditional Pearson's correlation and a Pearson's correlation applied to data corrected with surrogate variable analysis (SVA). AVAILABILITY: The method is implemented in the R programming language and may be found at http://genetics.cs.ucla.edu/mmc. CONTACT: nfurlott@cs.ucla.edu; eeskin@cs.ucla.edu.
Asunto(s)
Expresión Génica , Modelos Estadísticos , Perfilación de la Expresión Génica/métodos , Regulación de la Expresión Génica de las Plantas , Humanos , Levaduras/genéticaRESUMEN
Genome-wide association studies (GWASs) examine the association between genotype and phenotype while adjusting for a set of covariates. Although the covariates may have non-linear or interactive effects, due to the challenge of specifying the model, GWAS often neglect such terms. Here we introduce DeepNull, a method that identifies and adjusts for non-linear and interactive covariate effects using a deep neural network. In analyses of simulated and real data, we demonstrate that DeepNull maintains tight control of the type I error while increasing statistical power by up to 20% in the presence of non-linear and interactive effects. Moreover, in the absence of such effects, DeepNull incurs no loss of power. When applied to 10 phenotypes from the UK Biobank (n = 370K), DeepNull discovered more hits (+6%) and loci (+7%), on average, than conventional association analyses, many of which are biologically plausible or have previously been reported. Finally, DeepNull improves upon linear modeling for phenotypic prediction (+23% on average).
Asunto(s)
Estudio de Asociación del Genoma Completo/métodos , Fenotipo , Simulación por Computador , Modelos Lineales , Proyectos de InvestigaciónRESUMEN
Polygenic risk prediction is a widely investigated topic because of its promising clinical applications. Genetic variants in functional regions of the genome are enriched for complex trait heritability. Here, we introduce a method for polygenic prediction, LDpred-funct, that leverages trait-specific functional priors to increase prediction accuracy. We fit priors using the recently developed baseline-LD model, including coding, conserved, regulatory, and LD-related annotations. We analytically estimate posterior mean causal effect sizes and then use cross-validation to regularize these estimates, improving prediction accuracy for sparse architectures. We applied LDpred-funct to predict 21 highly heritable traits in the UK Biobank (avg N = 373 K as training data). LDpred-funct attained a +4.6% relative improvement in average prediction accuracy (avg prediction R2 = 0.144; highest R2 = 0.413 for height) compared to SBayesR (the best method that does not incorporate functional information). For height, meta-analyzing training data from UK Biobank and 23andMe cohorts (N = 1107 K) increased prediction R2 to 0.431. Our results show that incorporating functional priors improves polygenic prediction accuracy, consistent with the functional architecture of complex traits.
Asunto(s)
Bancos de Muestras Biológicas , Herencia Multifactorial , Genoma , Genotipo , Humanos , Modelos Genéticos , Fenotipo , Polimorfismo de Nucleótido Simple , Reino UnidoRESUMEN
We trained and validated risk prediction models for the three major types of skin cancer- basal cell carcinoma (BCC), squamous cell carcinoma (SCC), and melanoma-on a cross-sectional and longitudinal dataset of 210,000 consented research participants who responded to an online survey covering personal and family history of skin cancer, skin susceptibility, and UV exposure. We developed a primary disease risk score (DRS) that combined all 32 identified genetic and non-genetic risk factors. Top percentile DRS was associated with an up to 13-fold increase (odds ratio per standard deviation increase >2.5) in the risk of developing skin cancer relative to the middle DRS percentile. To derive lifetime risk trajectories for the three skin cancers, we developed a second and age independent disease score, called DRSA. Using incident cases, we demonstrated that DRSA could be used in early detection programs for identifying high risk asymptotic individuals, and predicting when they are likely to develop skin cancer. High DRSA scores were not only associated with earlier disease diagnosis (by up to 14 years), but also with more severe and recurrent forms of skin cancer.
Asunto(s)
Carcinoma Basocelular/epidemiología , Carcinoma de Células Escamosas/epidemiología , Melanoma/epidemiología , Modelos Estadísticos , Recurrencia Local de Neoplasia/epidemiología , Neoplasias Cutáneas/epidemiología , Adulto , Anciano , Anciano de 80 o más Años , Carcinoma Basocelular/etiología , Carcinoma Basocelular/patología , Carcinoma de Células Escamosas/etiología , Estudios Transversales , Conjuntos de Datos como Asunto , Pruebas Dirigidas al Consumidor/estadística & datos numéricos , Femenino , Estudios de Seguimiento , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Incidencia , Estudios Longitudinales , Masculino , Anamnesis , Melanoma/etiología , Melanoma/patología , Persona de Mediana Edad , Recurrencia Local de Neoplasia/etiología , Recurrencia Local de Neoplasia/patología , Oportunidad Relativa , Estudios Prospectivos , Medición de Riesgo/métodos , Factores de Riesgo , Piel/patología , Piel/efectos de la radiación , Neoplasias Cutáneas/etiología , Neoplasias Cutáneas/patología , Encuestas y Cuestionarios/estadística & datos numéricos , Rayos Ultravioleta/efectos adversos , Población Blanca/genéticaRESUMEN
Human eye color is highly heritable, but its genetic architecture is not yet fully understood. We report the results of the largest genome-wide association study for eye color to date, involving up to 192,986 European participants from 10 populations. We identify 124 independent associations arising from 61 discrete genomic regions, including 50 previously unidentified. We find evidence for genes involved in melanin pigmentation, but we also find associations with genes involved in iris morphology and structure. Further analyses in 1636 Asian participants from two populations suggest that iris pigmentation variation in Asians is genetically similar to Europeans, albeit with smaller effect sizes. Our findings collectively explain 53.2% (95% confidence interval, 45.4 to 61.0%) of eye color variation using common single-nucleotide polymorphisms. Overall, our study outcomes demonstrate that the genetic complexity of human eye color considerably exceeds previous knowledge and expectations, highlighting eye color as a genetically highly complex human trait.
RESUMEN
BACKGROUND: The characterization of structural variations (SV) such as insertions, deletions and copy number variations is a critical step in the process of understanding the full genetic architecture of organisms. Copy number variations (CNV) have attracted much recent attention due to their effects on gene expression and disease status. RESULTS: In this paper, we present a method that utilizes next-generation sequencing technologies (NGS), in order to both detect and reconstruct CNVs. We focus on a special type of CNV, namely tandemly organized de novo CNVs, which have been shown to occur with high frequency in the mouse genome. CONCLUSIONS: We apply our method to CNV regions randomly inserted into the reference mouse genome and show that our method achieves good performance for both detection and reconstruction of tandemly organized de novo CNVs.
Asunto(s)
Variaciones en el Número de Copia de ADN , Análisis de Secuencia de ADN , Algoritmos , Animales , Secuenciación de Nucleótidos de Alto Rendimiento , RatonesRESUMEN
BACKGROUND: Clinical genetic testing for inherited predisposition to venous thromboembolism (VTE) is common among patients and their families. However, there is incomplete consensus about which individuals should receive testing, and the relative risks and benefits. METHODS: We assessed outcomes of receiving direct-to-consumer (DTC) results for the two most common genetic risk factors for VTE, factor V Leiden in the F5 gene (FVL) and prothrombin 20210G>A in the F2 gene (PT). Two thousand three hundred fifty-four customers (1244 variant-positive and 1110 variant-negative individuals) of the personal genetics company 23andMe, Inc., who had received results online for F5 and F2 variants, participated in an online survey-based study. Participants responded to questions about perception of VTE risk, discussion of results with healthcare providers (HCPs) and recommendations received, actions taken to control risk, emotional responses to receiving risk results, and perceived value of the information. RESULTS: Most participants (90% of variant-positive individuals, 99% of variant-negative individuals) had not previously been tested for F5 and/or F2 variants. The majority of variant-positive individuals correctly perceived that they were at higher than average risk for developing VTE. These individuals reported moderate rates of discussing results with HCPs (41%); receiving prevention advice from HCPs (31%), and making behavioral changes to control risk (e.g., exercising more, 30%). A minority (36%) of variant-positive individuals worried more after receiving VTE results. Nevertheless, most participants reported that knowing their risk had been an advantage (78% variant-positive and 58% variant-negative) and were satisfied knowing their genetic probability for VTE (81% variant-positive and 67% variant-negative). CONCLUSION: Consumers reported moderate rates of behavioral change and perceived personal benefit from receiving DTC genetic results for VTE risk.