Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 76
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Nature ; 622(7984): 784-793, 2023 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-37821707

RESUMO

The Mexico City Prospective Study is a prospective cohort of more than 150,000 adults recruited two decades ago from the urban districts of Coyoacán and Iztapalapa in Mexico City1. Here we generated genotype and exome-sequencing data for all individuals and whole-genome sequencing data for 9,950 selected individuals. We describe high levels of relatedness and substantial heterogeneity in ancestry composition across individuals. Most sequenced individuals had admixed Indigenous American, European and African ancestry, with extensive admixture from Indigenous populations in central, southern and southeastern Mexico. Indigenous Mexican segments of the genome had lower levels of coding variation but an excess of homozygous loss-of-function variants compared with segments of African and European origin. We estimated ancestry-specific allele frequencies at 142 million genomic variants, with an effective sample size of 91,856 for Indigenous Mexican ancestry at exome variants, all available through a public browser. Using whole-genome sequencing, we developed an imputation reference panel that outperforms existing panels at common variants in individuals with high proportions of central, southern and southeastern Indigenous Mexican ancestry. Our work illustrates the value of genetic studies in diverse populations and provides foundational imputation and allele frequency resources for future genetic studies in Mexico and in the United States, where the Hispanic/Latino population is predominantly of Mexican descent.


Assuntos
Sequenciamento do Exoma , Genoma Humano , Genótipo , Hispânico ou Latino , Adulto , Humanos , África/etnologia , América/etnologia , Europa (Continente)/etnologia , Frequência do Gene/genética , Genética Populacional , Genoma Humano/genética , Técnicas de Genotipagem , Hispânico ou Latino/genética , Homozigoto , Mutação com Perda de Função/genética , México , Estudos Prospectivos
2.
Hum Mol Genet ; 2024 Jun 16.
Artigo em Inglês | MEDLINE | ID: mdl-38879759

RESUMO

Venous thromboembolism (VTE) is a significant contributor to morbidity and mortality, with large disparities in incidence rates between Black and White Americans. Polygenic risk scores (PRSs) limited to variants discovered in genome-wide association studies in European-ancestry samples can identify European-ancestry individuals at high risk of VTE. However, there is limited evidence on whether high-dimensional PRS constructed using more sophisticated methods and more diverse training data can enhance the predictive ability and their utility across diverse populations. We developed PRSs for VTE using summary statistics from the International Network against Venous Thrombosis (INVENT) consortium genome-wide association studies meta-analyses of European- (71 771 cases and 1 059 740 controls) and African-ancestry samples (7482 cases and 129 975 controls). We used LDpred2 and PRS-CSx to construct ancestry-specific and multi-ancestry PRSs and evaluated their performance in an independent European- (6781 cases and 103 016 controls) and African-ancestry sample (1385 cases and 12 569 controls). Multi-ancestry PRSs with weights tuned in European-ancestry samples slightly outperformed ancestry-specific PRSs in European-ancestry test samples (e.g. the area under the receiver operating curve [AUC] was 0.609 for PRS-CSx_combinedEUR and 0.608 for PRS-CSxEUR [P = 0.00029]). Multi-ancestry PRSs with weights tuned in African-ancestry samples also outperformed ancestry-specific PRSs in African-ancestry test samples (PRS-CSxAFR: AUC = 0.58, PRS-CSx_combined AFR: AUC = 0.59), although this difference was not statistically significant (P = 0.34). The highest fifth percentile of the best-performing PRS was associated with 1.9-fold and 1.68-fold increased risk for VTE among European- and African-ancestry subjects, respectively, relative to those in the middle stratum. These findings suggest that the multi-ancestry PRS might be used to improve performance across diverse populations to identify individuals at highest risk for VTE.

3.
Am J Hum Genet ; 109(9): 1582-1590, 2022 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-36055210

RESUMO

For the genomics community, allele frequencies within defined groups (or "strata") are useful across multiple research and clinical contexts. Benefits include allowing researchers to identify populations for replication or "look up" studies, enabling researchers to compare population-specific frequencies to validate findings, and facilitating assessment of variant pathogenicity in clinical contexts. However, there are potential concerns with stratified allele frequencies. These include potential re-identification (determining whether or not an individual participated in a given research study based on allele frequencies and individual-level genetic data), harm from associating stigmatizing variants with specific groups, potential reification of race as a biological rather than a socio-political category, and whether presenting stratified frequencies-and the downstream applications that this presentation enables-is consistent with participants' informed consents. The NHLBI Trans-Omics for Precision Medicine (TOPMed) program considered the scientific and social implications of different approaches for adding stratified frequencies to the TOPMed BRAVO (Browse All Variants Online) variant server. We recommend a novel approach of presenting ancestry-specific allele frequencies using a statistical method based upon local genetic ancestry inference. Notably, this approach does not require grouping individuals by either predominant global ancestry or race/ethnicity and, therefore, mitigates re-identification and other concerns as the mixture distribution of ancestral allele frequencies varies across the genome. Here we describe our considerations and approach, which can assist other genomics research programs facing similar issues of how to define and present stratified frequencies in publicly available variant databases.


Assuntos
Motivação , Medicina de Precisão , Etnicidade/genética , Frequência do Gene/genética , Genômica/métodos , Humanos
5.
Hum Mol Genet ; 31(19): 3367-3376, 2022 09 29.
Artigo em Inglês | MEDLINE | ID: mdl-34718574

RESUMO

In the era of personalized medicine with more and more patient-specific targeted therapies being used, we need reliable, dynamic, faster and sensitive biomarkers both to track the causes of disease and to develop and evolve therapies during the course of treatment. Metabolomics recently has shown substantial evidence to support its emerging role in disease diagnosis and prognosis. Aside from biomarkers and development of therapies, it is also an important goal to understand the involvement of mitochondrial DNA (mtDNA) in metabolic regulation, aging and disease development. Somatic mutations of the mitochondrial genome are also heavily implicated in age-related disease and aging. The general hypothesis is that an alteration in the concentration of metabolite profiles (possibly conveyed by lifestyle and environmental factors) influences the increase of mutation rate in the mtDNA and thereby contributes to a range of pathophysiological alterations observed in complex diseases. We performed an inverted mitochondrial genome-wide association analysis between mitochondrial nucleotide variants (mtSNVs) and concentration of metabolites. We used 151 metabolites and the whole sequenced mitochondrial genome from 2718 individuals to identify the genetic variants associated with metabolite profiles. Because of the high coverage, next-generation sequencing-based analysis of the mitochondrial genome allows for an accurate detection of mitochondrial heteroplasmy and for the identification of variants associated with the metabolome. The strongest association was found for mt715G > A located in the MT-12SrRNA with the metabolite ratio of C2/C10:1 (P-value = 6.82*10-09, ß = 0.909). The second most significant mtSNV was found for mt3714A > G located in the MT-ND1 with the metabolite ratio of phosphatidylcholine (PC) ae C42:5/PC ae C44:5 (P-value = 1.02*10-08, ß = 3.631). A large number of significant metabolite ratios were observed involving PC aa C36:6 and the variant mt10689G > A, located in the MT-ND4L gene. These results show an important interconnection between mitochondria and metabolite concentrations. Considering that some of the significant metabolites found in this study have been previously related to complex diseases, such as neurological disorders and metabolic conditions, these associations found here might play a crucial role for further investigations of such complex diseases. Understanding the mechanisms that control human health and disease, in particular, the role of genetic predispositions and their interaction with environmental factors is a prerequisite for the development of safe and efficient therapies for complex disorders.


Assuntos
Estudo de Associação Genômica Ampla , Metabolômica , Biomarcadores/metabolismo , DNA Mitocondrial/genética , DNA Mitocondrial/metabolismo , Humanos , Metabolômica/métodos , Mitocôndrias/genética , Mitocôndrias/metabolismo , Nucleotídeos/metabolismo , Fosfatidilcolinas/metabolismo
6.
Am J Hum Genet ; 108(4): 669-681, 2021 04 01.
Artigo em Inglês | MEDLINE | ID: mdl-33730541

RESUMO

Tests of association between a phenotype and a set of genes in a biological pathway can provide insights into the genetic architecture of complex phenotypes beyond those obtained from single-variant or single-gene association analysis. However, most existing gene set tests have limited power to detect gene set-phenotype association when a small fraction of the genes are associated with the phenotype and cannot identify the potentially "active" genes that might drive a gene set-based association. To address these issues, we have developed Gene set analysis Association Using Sparse Signals (GAUSS), a method for gene set association analysis that requires only GWAS summary statistics. For each significantly associated gene set, GAUSS identifies the subset of genes that have the maximal evidence of association and can best account for the gene set association. Using pre-computed correlation structure among test statistics from a reference panel, our p value calculation is substantially faster than other permutation- or simulation-based approaches. In simulations with varying proportions of causal genes, we find that GAUSS effectively controls type 1 error rate and has greater power than several existing methods, particularly when a small proportion of genes account for the gene set signal. Using GAUSS, we analyzed UK Biobank GWAS summary statistics for 10,679 gene sets and 1,403 binary phenotypes. We found that GAUSS is scalable and identified 13,466 phenotype and gene set association pairs. Within these gene sets, we identify an average of 17.2 (max = 405) genes that underlie these gene set associations.


Assuntos
Bancos de Espécimes Biológicos , Interpretação Estatística de Dados , Bases de Dados Genéticas , Conjuntos de Dados como Assunto , Estudo de Associação Genômica Ampla/métodos , Fenótipo , Transportadores de Cassetes de Ligação de ATP/genética , Simulação por Computador , Expressão Gênica/genética , Humanos , Projetos de Pesquisa , Fatores de Tempo , Reino Unido , Navegador
7.
Proc Natl Acad Sci U S A ; 117(5): 2560-2569, 2020 02 04.
Artigo em Inglês | MEDLINE | ID: mdl-31964835

RESUMO

De novo mutations (DNMs), or mutations that appear in an individual despite not being seen in their parents, are an important source of genetic variation whose impact is relevant to studies of human evolution, genetics, and disease. Utilizing high-coverage whole-genome sequencing data as part of the Trans-Omics for Precision Medicine (TOPMed) Program, we called 93,325 single-nucleotide DNMs across 1,465 trios from an array of diverse human populations, and used them to directly estimate and analyze DNM counts, rates, and spectra. We find a significant positive correlation between local recombination rate and local DNM rate, and that DNM rate explains a substantial portion (8.98 to 34.92%, depending on the model) of the genome-wide variation in population-level genetic variation from 41K unrelated TOPMed samples. Genome-wide heterozygosity does correlate with DNM rate, but only explains <1% of variation. While we are underpowered to see small differences, we do not find significant differences in DNM rate between individuals of European, African, and Latino ancestry, nor across ancestrally distinct segments within admixed individuals. However, we did find significantly fewer DNMs in Amish individuals, even when compared with other Europeans, and even after accounting for parental age and sequencing center. Specifically, we found significant reductions in the number of C→A and T→C mutations in the Amish, which seem to underpin their overall reduction in DNMs. Finally, we calculated near-zero estimates of narrow sense heritability (h2), which suggest that variation in DNM rate is significantly shaped by nonadditive genetic effects and the environment.


Assuntos
Amish/genética , Genoma Humano , Adulto , Estudos de Coortes , Análise Mutacional de DNA , Feminino , Genética Populacional , Heterozigoto , Humanos , Masculino , Mutação , Linhagem , Sequenciamento Completo do Genoma , Adulto Jovem
8.
Mol Psychiatry ; 26(9): 5239-5250, 2021 09.
Artigo em Inglês | MEDLINE | ID: mdl-33483695

RESUMO

Bipolar disorder (BD) is a serious mental illness with substantial common variant heritability. However, the role of rare coding variation in BD is not well established. We examined the protein-coding (exonic) sequences of 3,987 unrelated individuals with BD and 5,322 controls of predominantly European ancestry across four cohorts from the Bipolar Sequencing Consortium (BSC). We assessed the burden of rare, protein-altering, single nucleotide variants classified as pathogenic or likely pathogenic (P-LP) both exome-wide and within several groups of genes with phenotypic or biologic plausibility in BD. While we observed an increased burden of rare coding P-LP variants within 165 genes identified as BD GWAS regions in 3,987 BD cases (meta-analysis OR = 1.9, 95% CI = 1.3-2.8, one-sided p = 6.0 × 10-4), this enrichment did not replicate in an additional 9,929 BD cases and 14,018 controls (OR = 0.9, one-side p = 0.70). Although BD shares common variant heritability with schizophrenia, in the BSC sample we did not observe a significant enrichment of P-LP variants in SCZ GWAS genes, in two classes of neuronal synaptic genes (RBFOX2 and FMRP) associated with SCZ or in loss-of-function intolerant genes. In this study, the largest analysis of exonic variation in BD, individuals with BD do not carry a replicable enrichment of rare P-LP variants across the exome or in any of several groups of genes with biologic plausibility. Moreover, despite a strong shared susceptibility between BD and SCZ through common genetic variation, we do not observe an association between BD risk and rare P-LP coding variants in genes known to modulate risk for SCZ.


Assuntos
Transtorno Bipolar , Esquizofrenia , Transtorno Bipolar/genética , Exoma/genética , Predisposição Genética para Doença/genética , Variação Genética/genética , Estudo de Associação Genômica Ampla , Humanos , Polimorfismo de Nucleotídeo Único/genética , Esquizofrenia/genética
9.
Proc Natl Acad Sci U S A ; 115(2): 379-384, 2018 01 09.
Artigo em Inglês | MEDLINE | ID: mdl-29279374

RESUMO

A major challenge in evaluating the contribution of rare variants to complex disease is identifying enough copies of the rare alleles to permit informative statistical analysis. To investigate the contribution of rare variants to the risk of type 2 diabetes (T2D) and related traits, we performed deep whole-genome analysis of 1,034 members of 20 large Mexican-American families with high prevalence of T2D. If rare variants of large effect accounted for much of the diabetes risk in these families, our experiment was powered to detect association. Using gene expression data on 21,677 transcripts for 643 pedigree members, we identified evidence for large-effect rare-variant cis-expression quantitative trait loci that could not be detected in population studies, validating our approach. However, we did not identify any rare variants of large effect associated with T2D, or the related traits of fasting glucose and insulin, suggesting that large-effect rare variants account for only a modest fraction of the genetic risk of these traits in this sample of families. Reliable identification of large-effect rare variants will require larger samples of extended pedigrees or different study designs that further enrich for such variants.


Assuntos
Diabetes Mellitus Tipo 2/genética , Predisposição Genética para Doença/genética , Variação Genética , Americanos Mexicanos/genética , Diabetes Mellitus Tipo 2/etnologia , Diabetes Mellitus Tipo 2/patologia , Saúde da Família , Feminino , Frequência do Gene , Predisposição Genética para Doença/etnologia , Estudo de Associação Genômica Ampla/métodos , Genótipo , Humanos , Masculino , Linhagem , Fenótipo , Locos de Características Quantitativas/genética , Sequenciamento Completo do Genoma/métodos
10.
Genome Res ; 26(4): 417-26, 2016 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-26916109

RESUMO

Although previous studies have documented a bottleneck in the transmission of mtDNA genomes from mothers to offspring, several aspects remain unclear, including the size and nature of the bottleneck. Here, we analyze the dynamics of mtDNA heteroplasmy transmission in the Genomes of the Netherlands (GoNL) data, which consists of complete mtDNA genome sequences from 228 trios, eight dizygotic (DZ) twin quartets, and 10 monozygotic (MZ) twin quartets. Using a minor allele frequency (MAF) threshold of 2%, we identified 189 heteroplasmies in the trio mothers, of which 59% were transmitted to offspring, and 159 heteroplasmies in the trio offspring, of which 70% were inherited from the mothers. MZ twin pairs exhibited greater similarity in MAF at heteroplasmic sites than DZ twin pairs, suggesting that the heteroplasmy MAF in the oocyte is the major determinant of the heteroplasmy MAF in the offspring. We used a likelihood method to estimate the effective number of mtDNA genomes transmitted to offspring under different bottleneck models; a variable bottleneck size model provided the best fit to the data, with an estimated mean of nine individual mtDNA genomes transmitted. We also found evidence for negative selection during transmission against novel heteroplasmies (in which the minor allele has never been observed in polymorphism data). These novel heteroplasmies are enhanced for tRNA and rRNA genes, and mutations associated with mtDNA diseases frequently occur in these genes. Our results thus suggest that the female germ line is able to recognize and select against deleterious heteroplasmies.


Assuntos
DNA Mitocondrial , Família , Heterogeneidade Genética , Padrões de Herança , População Branca/genética , Alelos , Feminino , Frequência do Gene , Humanos , Masculino , Modelos Genéticos , Modelos Estatísticos , Mutação , Países Baixos , Polimorfismo Genético , Seleção Genética , Gêmeos
11.
BMC Genomics ; 19(1): 845, 2018 Nov 28.
Artigo em Inglês | MEDLINE | ID: mdl-30486787

RESUMO

BACKGROUND: The spectrum of somatic single-nucleotide variants in cancer genomes often reflects the signatures of multiple distinct mutational processes, which can provide clinically actionable insights into cancer etiology. Existing software tools for identifying and evaluating these mutational signatures do not scale to analyze large datasets containing thousands of individuals or millions of variants. RESULTS: We introduce Helmsman, a program designed to perform mutation signature analysis on arbitrarily large sequencing datasets. Helmsman is up to 300 times faster than existing software. Helmsman's memory usage is independent of the number of variants, resulting in a small enough memory footprint to analyze datasets that would otherwise exceed the memory limitations of other programs. CONCLUSIONS: Helmsman is a computationally efficient tool that enables users to evaluate mutational signatures in massive sequencing datasets that are otherwise intractable with existing software. Helmsman is freely available at https://github.com/carjed/helmsman .


Assuntos
Análise Mutacional de DNA/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Software , Mutação/genética , Reprodutibilidade dos Testes
13.
Genet Epidemiol ; 39(5): 325-33, 2015 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-25966809

RESUMO

Advances in DNA sequencing technology facilitate investigating the impact of rare variants on complex diseases. However, using a conventional case-control design, large samples are needed to capture enough rare variants to achieve sufficient power for testing the association between suspected loci and complex diseases. In such large samples, population stratification may easily cause spurious signals. One approach to overcome stratification is to use a family-based design. For rare variants, this strategy is especially appropriate, as power can be increased considerably by analyzing cases with affected relatives. We propose a novel framework for association testing in affected sibpairs by comparing the allele count of rare variants on chromosome regions shared identical by descent to the allele count of rare variants on nonshared chromosome regions, referred to as test for rare variant association with family-based internal control (TRAFIC). This design is generally robust to population stratification as cases and controls are matched within each sibpair. We evaluate the power analytically using general model for effect size of rare variants. For the same number of genotyped people, TRAFIC shows superior power over the conventional case-control study for variants with summed risk allele frequency f < 0.05; this power advantage is even more substantial when considering allelic heterogeneity. For complex models of gene-gene interaction, this power advantage depends on the direction of interaction and overall heritability. In sum, we introduce a new method for analyzing rare variants in affected sibpairs that is robust to population stratification, and provide freely available software.


Assuntos
Ligação Genética , Predisposição Genética para Doença , Variação Genética/genética , Análise por Pareamento , Modelos Genéticos , Modelos Estatísticos , Software , Estudos de Casos e Controles , Simulação por Computador , Frequência do Gene , Genótipo , Humanos , Núcleo Familiar , Característica Quantitativa Herdável
14.
Am J Hum Genet ; 93(1): 90-102, 2013 Jul 11.
Artigo em Inglês | MEDLINE | ID: mdl-23746547

RESUMO

Exome sequencing offers the potential to study the population-genomic variables that underlie patterns of deleterious variation. Runs of homozygosity (ROH) are long stretches of consecutive homozygous genotypes probably reflecting segments shared identically by descent as the result of processes such as consanguinity, population size reduction, and natural selection. The relationship between ROH and patterns of predicted deleterious variation can provide insight into the way in which these processes contribute to the maintenance of deleterious variants. Here, we use exome sequencing to examine ROH in relation to the distribution of deleterious variation in 27 individuals of varying levels of apparent inbreeding from 6 human populations. A significantly greater fraction of all genome-wide predicted damaging homozygotes fall in ROH than would be expected from the corresponding fraction of nondamaging homozygotes in ROH (p < 0.001). This pattern is strongest for long ROH (p < 0.05). ROH, and especially long ROH, harbor disproportionately more deleterious homozygotes than would be expected on the basis of the total ROH coverage of the genome and the genomic distribution of nondamaging homozygotes. The results accord with a hypothesis that recent inbreeding, which generates long ROH, enables rare deleterious variants to exist in homozygous form. Thus, just as inbreeding can elevate the occurrence of rare recessive diseases that represent homozygotes for strongly deleterious mutations, inbreeding magnifies the occurrence of mildly deleterious variants as well.


Assuntos
Genética Populacional/métodos , Genoma Humano , Variação Estrutural do Genoma , Homozigoto , Alelos , Biologia Computacional/métodos , Consanguinidade , Exoma , Heterozigoto , Humanos , Mutação de Sentido Incorreto , Polimorfismo de Nucleotídeo Único , Valor Preditivo dos Testes
15.
Genome Res ; 23(12): 1974-84, 2013 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-23990608

RESUMO

Understanding patterns of spontaneous mutations is of fundamental interest in studies of human genome evolution and genetic disease. Here, we used extremely rare variants in humans to model the molecular spectrum of single-nucleotide mutations. Compared to common variants in humans and human-chimpanzee fixed differences (substitutions), rare variants, on average, arose more recently in the human lineage and are less affected by the potentially confounding effects of natural selection, population demographic history, and biased gene conversion. We analyzed variants obtained from a population-based sequencing study of 202 genes in >14,000 individuals. We observed considerable variability in the per-gene mutation rate, which was correlated with local GC content, but not recombination rate. Using >20,000 variants with a derived allele frequency ≤ 10(-4), we examined the effect of local GC content and recombination rate on individual variant subtypes and performed comparisons with common variants and substitutions. The influence of local GC content on rare variants differed from that on common variants or substitutions, and the differences varied by variant subtype. Furthermore, recombination rate and recombination hotspots have little effect on rare variants of any subtype, yet both have a relatively strong impact on multiple variant subtypes in common variants and substitutions. This observation is consistent with the effect of biased gene conversion or selection-dependent processes. Our results highlight the distinct biases inherent in the initial mutation patterns and subsequent evolutionary processes that affect segregating variants.


Assuntos
Variação Genética , Genoma Humano , Mutação Puntual , Animais , Composição de Bases , Evolução Molecular , Conversão Gênica , Frequência do Gene , Genômica , Humanos , Modelos Logísticos , Modelos Genéticos , Taxa de Mutação , Pan troglodytes/genética , Filogenia , Recombinação Genética , Seleção Genética
16.
BMC Bioinformatics ; 16: 75, 2015 Mar 07.
Artigo em Inglês | MEDLINE | ID: mdl-25884587

RESUMO

BACKGROUND: Sequencing studies of exonic regions aim to identify rare variants contributing to complex traits. With high coverage and large sample size, these studies tend to apply simple variant calling algorithms. However, coverage is often heterogeneous; sites with insufficient coverage may benefit from sophisticated calling algorithms used in low-coverage sequencing studies. We evaluate the potential benefits of different calling strategies by performing a comparative analysis of variant calling methods on exonic data from 202 genes sequenced at 24x in 7,842 individuals. We call variants using individual-based, population-based and linkage disequilibrium (LD)-aware methods with stringent quality control. We measure genotype accuracy by the concordance with on-target GWAS genotypes and between 80 pairs of sequencing replicates. We validate selected singleton variants using capillary sequencing. RESULTS: Using these calling methods, we detected over 27,500 variants at the targeted exons; >57% were singletons. The singletons identified by individual-based analyses were of the highest quality. However, individual-based analyses generated more missing genotypes (4.72%) than population-based (0.47%) and LD-aware (0.17%) analyses. Moreover, individual-based genotypes were the least concordant with array-based genotypes and replicates. Population-based genotypes were less concordant than genotypes from LD-aware analyses with extended haplotypes. We reanalyzed the same dataset with a second set of callers and showed again that the individual-based caller identified more high-quality singletons than the population-based caller. We also replicated this result in a second dataset of 57 genes sequenced at 127.5x in 3,124 individuals. CONCLUSIONS: We recommend population-based analyses for high quality variant calls with few missing genotypes. With extended haplotypes, LD-aware methods generate the most accurate and complete genotypes. In addition, individual-based analyses should complement the above methods to obtain the most singleton variants.


Assuntos
Algoritmos , Biomarcadores/análise , Doença/genética , Éxons/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Polimorfismo de Nucleotídeo Único/genética , Software , Genética Populacional , Genoma Humano , Genótipo , Haplótipos/genética , Humanos , Desequilíbrio de Ligação
17.
PLoS Genet ; 8(8): e1002886, 2012 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-22927824

RESUMO

Multivariate statistical techniques such as principal components analysis (PCA) and multidimensional scaling (MDS) have been widely used to summarize the structure of human genetic variation, often in easily visualized two-dimensional maps. Many recent studies have reported similarity between geographic maps of population locations and MDS or PCA maps of genetic variation inferred from single-nucleotide polymorphisms (SNPs). However, this similarity has been evident primarily in a qualitative sense; and, because different multivariate techniques and marker sets have been used in different studies, it has not been possible to formally compare genetic variation datasets in terms of their levels of similarity with geography. In this study, using genome-wide SNP data from 128 populations worldwide, we perform a systematic analysis to quantitatively evaluate the similarity of genes and geography in different geographic regions. For each of a series of regions, we apply a Procrustes analysis approach to find an optimal transformation that maximizes the similarity between PCA maps of genetic variation and geographic maps of population locations. We consider examples in Europe, Sub-Saharan Africa, Asia, East Asia, and Central/South Asia, as well as in a worldwide sample, finding that significant similarity between genes and geography exists in general at different geographic levels. The similarity is highest in our examples for Asia and, once highly distinctive populations have been removed, Sub-Saharan Africa. Our results provide a quantitative assessment of the geographic structure of human genetic variation worldwide, supporting the view that geography plays a strong role in giving rise to human population structure.


Assuntos
Genética Populacional , Genoma Humano , Polimorfismo de Nucleotídeo Único , Emigração e Imigração , Geografia , Haplótipos , Humanos , Análise de Componente Principal
18.
Bipolar Disord ; 16(3): 277-88, 2014 May.
Artigo em Inglês | MEDLINE | ID: mdl-24330321

RESUMO

OBJECTIVE: Since a poor diet is often cited as a contributor to metabolic syndrome for subjects diagnosed with bipolar disorder and schizophrenia, we sought to examine dietary intake, cigarette smoking, and physical activity in these populations and compare them with those for the general population. METHODS: Individuals diagnosed with bipolar disorder (n = 116) and schizophrenia (n = 143) were assessed for dietary intake, lifestyle habits, and metabolic syndrome and compared to age-, gender-, and race-matched subjects from the National Health and Nutrition Examination Survey (NHANES) 1999-2000. Additionally, matched subgroups within the patient populations were compared to elicit any differences. RESULTS: As expected, the metabolic syndrome rate was higher in the samples with bipolar disorder (33%) and schizophrenia (47%) compared to matched NHANES controls (17% and 11%, respectively), and not different between the patient groups. Surprisingly, both subjects with bipolar disorder and those with schizophrenia consumed fewer total calories, carbohydrates and fats, as well as more fiber (p < 0.03), compared to NHANES controls. No dietary or activity differences between patient participants with and without metabolic syndrome were found. Subjects with schizophrenia had significantly lower total and low-density cholesterol levels (p < 0.0001) compared to NHANES controls. Subjects with bipolar disorder smoked less (p = 0.001), exercised more (p = 0.004), and had lower body mass indexes (p = 0.009) compared to subjects with schizophrenia. CONCLUSIONS: Counter to predictions, few dietary differences could be discerned between schizophrenia, bipolar disorder, and NHANES control groups. The subjects with bipolar disorder exhibited healthier behaviors than the patients with schizophrenia. Additional research regarding metabolic syndrome mechanisms, focusing on non-dietary contributions, is needed.


Assuntos
Transtorno Bipolar/complicações , Dieta , Estilo de Vida , Doenças Metabólicas/etiologia , Esquizofrenia/complicações , Adulto , Análise de Variância , Estudos de Casos e Controles , Planejamento em Saúde Comunitária , Feminino , Humanos , Masculino , Doenças Metabólicas/diagnóstico , Pessoa de Meia-Idade , Inquéritos Nutricionais , Fatores de Risco
19.
Nat Rev Genet ; 9(7): 527-40, 2008 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-18560438

RESUMO

Several psychiatric disorders--such as bipolar disorder, schizophrenia and autism--are highly heritable, yet identifying their genetic basis has been challenging, with most discoveries failing to be replicated. However, inroads have been made by the incorporation of intermediate traits (endophenotypes) and of environmental factors into genetic analyses, and through the identification of rare inherited variants and novel structural mutations. Current efforts aim to increase sample sizes by gathering larger samples for case-control studies or through meta-analyses of such studies. More attention on unique families, rare variants, and on incorporating environment and the emerging knowledge of biological function and pathways into genetic analysis is warranted.


Assuntos
Predisposição Genética para Doença , Transtornos Mentais/genética , Animais , Variação Genética , Genoma Humano , Humanos , Transtornos Mentais/diagnóstico , Fatores de Risco
20.
PLoS Genet ; 7(6): e1002134, 2011 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-21738484

RESUMO

Although a highly heritable and disabling disease, bipolar disorder's (BD) genetic variants have been challenging to identify. We present new genotype data for 1,190 cases and 401 controls and perform a genome-wide association study including additional samples for a total of 2,191 cases and 1,434 controls. We do not detect genome-wide significant associations for individual loci; however, across all SNPs, we show an association between the power to detect effects calculated from a previous genome-wide association study and evidence for replication (P = 1.5×10(-7)). To demonstrate that this result is not likely to be a false positive, we analyze replication rates in a large meta-analysis of height and show that, in a large enough study, associations replicate as a function of power, approaching a linear relationship. Within BD, SNPs near exons exhibit a greater probability of replication, supporting an enrichment of reproducible associations near functional regions of genes. These results indicate that there is likely common genetic variation associated with BD near exons (±10 kb) that could be identified in larger studies and, further, provide a framework for assessing the potential for replication when combining results from multiple studies.


Assuntos
Transtorno Bipolar/genética , Estudo de Associação Genômica Ampla , Biologia Computacional , Éxons , Genótipo , Humanos , Polimorfismo de Nucleotídeo Único/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA