Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 47
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Phys Chem Chem Phys ; 21(26): 14005-14011, 2019 Jul 03.
Artigo em Inglês | MEDLINE | ID: mdl-30620013

RESUMO

Low temperature reactions between laser-cooled Be+(2S1/2) ions and partially deuterated water (HOD) molecules have been investigated using an ion trap and interpreted with zero-point corrected quasi-classical trajectory calculations on a highly accurate global potential energy surface for the ground electronic state. Both product channels have been observed for the first time, and the branching to BeOD+ + H is found to be 0.58 ± 0.14. The experimental observation is reproduced by both quasi-classical trajectory and statistical calculations. Theoretical analyses reveal that the branching to the two product channels is largely due to the availability of open states in each channel.

2.
Genet Epidemiol ; 41(8): 756-768, 2017 12.
Artigo em Inglês | MEDLINE | ID: mdl-28875524

RESUMO

A genome-wide association study (GWAS) correlates marker and trait variation in a study sample. Each subject is genotyped at a multitude of SNPs (single nucleotide polymorphisms) spanning the genome. Here, we assume that subjects are randomly collected unrelateds and that trait values are normally distributed or can be transformed to normality. Over the past decade, geneticists have been remarkably successful in applying GWAS analysis to hundreds of traits. The massive amount of data produced in these studies present unique computational challenges. Penalized regression with the ℓ1 penalty (LASSO) or minimax concave penalty (MCP) penalties is capable of selecting a handful of associated SNPs from millions of potential SNPs. Unfortunately, model selection can be corrupted by false positives and false negatives, obscuring the genetic underpinning of a trait. Here, we compare LASSO and MCP penalized regression to iterative hard thresholding (IHT). On GWAS regression data, IHT is better at model selection and comparable in speed to both methods of penalized regression. This conclusion holds for both simulated and real GWAS data. IHT fosters parallelization and scales well in problems with large numbers of causal markers. Our parallel implementation of IHT accommodates SNP genotype compression and exploits multiple CPU cores and graphics processing units (GPUs). This allows statistical geneticists to leverage commodity desktop computers in GWAS analysis and to avoid supercomputing. AVAILABILITY: Source code is freely available at https://github.com/klkeys/IHT.jl.


Assuntos
Estudo de Associação Genômica Ampla , Modelos Genéticos , Algoritmos , Índice de Massa Corporal , HDL-Colesterol/genética , LDL-Colesterol/genética , Humanos , Fenótipo , Polimorfismo de Nucleotídeo Único , Triglicerídeos/genética
3.
Bioinformatics ; 32(15): 2364-5, 2016 08 01.
Artigo em Inglês | MEDLINE | ID: mdl-27153715

RESUMO

MOTIVATION: The challenges of successfully applying causal inference methods include: (i) satisfying underlying assumptions, (ii) limitations in data/models accommodated by the software and (iii) low power of common multiple testing approaches. RESULTS: The causal inference test (CIT) is based on hypothesis testing rather than estimation, allowing the testable assumptions to be evaluated in the determination of statistical significance. A user-friendly software package provides P-values and optionally permutation-based FDR estimates (q-values) for potential mediators. It can handle single and multiple binary and continuous instrumental variables, binary or continuous outcome variables and adjustment covariates. Also, the permutation-based FDR option provides a non-parametric implementation. CONCLUSION: Simulation studies demonstrate the validity of the cit package and show a substantial advantage of permutation-based FDR over other common multiple testing strategies. AVAILABILITY AND IMPLEMENTATION: The cit open-source R package is freely available from the CRAN website (https://cran.r-project.org/web/packages/cit/index.html) with embedded C ++ code that utilizes the GNU Scientific Library, also freely available (http://www.gnu.org/software/gsl/). CONTACT: joshua.millstein@usc.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Genômica , Software , Biblioteca Gênica , Genoma , Modelos Teóricos
4.
Nature ; 476(7359): 170-5, 2011 Jul 20.
Artigo em Inglês | MEDLINE | ID: mdl-21775986

RESUMO

Recombination, together with mutation, gives rise to genetic variation in populations. Here we leverage the recent mixture of people of African and European ancestry in the Americas to build a genetic map measuring the probability of crossing over at each position in the genome, based on about 2.1 million crossovers in 30,000 unrelated African Americans. At intervals of more than three megabases it is nearly identical to a map built in Europeans. At finer scales it differs significantly, and we identify about 2,500 recombination hotspots that are active in people of West African ancestry but nearly inactive in Europeans. The probability of a crossover at these hotspots is almost fully controlled by the alleles an individual carries at PRDM9 (P value < 10(-245)). We identify a 17-base-pair DNA sequence motif that is enriched in these hotspots, and is an excellent match to the predicted binding target of PRDM9 alleles common in West Africans and rare in Europeans. Sites of this motif are predicted to be risk loci for disease-causing genomic rearrangements in individuals carrying these alleles. More generally, this map provides a resource for research in human genetic variation and evolution.


Assuntos
Negro ou Afro-Americano/genética , Troca Genética/genética , Genoma Humano/genética , África Ocidental/etnologia , Alelos , Motivos de Aminoácidos , Sequência de Bases , Mapeamento Cromossômico , Europa (Continente)/etnologia , Evolução Molecular , Feminino , Frequência do Gene , Genética Populacional , Genômica , Haplótipos/genética , Histona-Lisina N-Metiltransferase/química , Histona-Lisina N-Metiltransferase/genética , Histona-Lisina N-Metiltransferase/metabolismo , Humanos , Masculino , Dados de Sequência Molecular , Linhagem , Polimorfismo de Nucleotídeo Único/genética , Probabilidade , População Branca/genética
5.
Hum Mol Genet ; 23(20): 5518-26, 2014 Oct 15.
Artigo em Inglês | MEDLINE | ID: mdl-24852375

RESUMO

Genome-wide association studies have identified 73 breast cancer risk variants mainly in European populations. Given considerable differences in linkage disequilibrium structure between populations of European and African ancestry, the known risk variants may not be informative for risk in African ancestry populations. In a previous fine-mapping investigation of 19 breast cancer loci, we were able to identify SNPs in four regions that better captured risk associations in African American women. In this study of breast cancer in African American women (3016 cases, 2745 controls), we tested an additional 54 novel breast cancer risk variants. Thirty-eight variants (70%) were found to have an association with breast cancer in the same direction as previously reported, with eight (15%) replicating at P < 0.05. Through fine-mapping, in three regions (1q32, 3p24, 10q25), we identified variants that better captured associations with overall breast cancer or estrogen receptor positive disease. We also observed suggestive associations with variants (at P < 5 × 10(-6)) in three separate regions (6q25, 14q13, 22q12) that may represent novel risk variants. Directional consistency of association observed for ∼65-70% of currently known genetic variants for breast cancer in women of African ancestry implies a shared functional common variant at most loci. To validate and enhance the spectrum of alleles that define associations at the known breast cancer risk loci, as well as genome-wide, will require even larger collaborative efforts in women of African ancestry.


Assuntos
Negro ou Afro-Americano/genética , Neoplasias da Mama/genética , Predisposição Genética para Doença , Feminino , Loci Gênicos , Variação Genética , Estudo de Associação Genômica Ampla , Humanos , Polimorfismo de Nucleotídeo Único , Receptores de Estrogênio/genética
6.
Hum Mol Genet ; 23(12): 3327-42, 2014 Jun 15.
Artigo em Inglês | MEDLINE | ID: mdl-24493794

RESUMO

Age at menopause marks the end of a woman's reproductive life and its timing associates with risks for cancer, cardiovascular and bone disorders. GWAS and candidate gene studies conducted in women of European ancestry have identified 27 loci associated with age at menopause. The relevance of these loci to women of African ancestry has not been previously studied. We therefore sought to uncover additional menopause loci and investigate the relevance of European menopause loci by performing a GWAS meta-analysis in 6510 women with African ancestry derived from 11 studies across the USA. We did not identify any additional loci significantly associated with age at menopause in African Americans. We replicated the associations between six loci and age at menopause (P-value < 0.05): AMHR2, RHBLD2, PRIM1, HK3/UMC1, BRSK1/TMEM150B and MCM8. In addition, associations of 14 loci are directionally consistent with previous reports. We provide evidence that genetic variants influencing reproductive traits identified in European populations are also important in women of African ancestry residing in USA.


Assuntos
Negro ou Afro-Americano/genética , Menopausa/etnologia , Menopausa/genética , População Branca/genética , Fatores Etários , Cromossomos Humanos , Feminino , Loci Gênicos , Variação Genética , Estudo de Associação Genômica Ampla , Humanos , Estados Unidos
7.
Hum Genet ; 135(8): 869-80, 2016 08.
Artigo em Inglês | MEDLINE | ID: mdl-27193597

RESUMO

Relative to European Americans, type 2 diabetes (T2D) is more prevalent in African Americans (AAs). Genetic variation may modulate transcript abundance in insulin-responsive tissues and contribute to risk; yet, published studies identifying expression quantitative trait loci (eQTLs) in African ancestry populations are restricted to blood cells. This study aims to develop a map of genetically regulated transcripts expressed in tissues important for glucose homeostasis in AAs, critical for identifying the genetic etiology of T2D and related traits. Quantitative measures of adipose and muscle gene expression, and genotypic data were integrated in 260 non-diabetic AAs to identify expression regulatory variants. Their roles in genetic susceptibility to T2D, and related metabolic phenotypes, were evaluated by mining GWAS datasets. eQTL analysis identified 1971 and 2078 cis-eGenes in adipose and muscle, respectively. Cis-eQTLs for 885 transcripts including top cis-eGenes CHURC1, USMG5, and ERAP2 were identified in both tissues. 62.1 % of top cis-eSNPs were within ±50 kb of transcription start sites and cis-eGenes were enriched for mitochondrial transcripts. Mining GWAS databases revealed association of cis-eSNPs for more than 50 genes with T2D (e.g. PIK3C2A, RBMS1, UFSP1), gluco-metabolic phenotypes (e.g. INPP5E, SNX17, ERAP2, FN3KRP), and obesity (e.g. POMC, CPEB4). Integration of GWAS meta-analysis data from AA cohorts revealed the most significant association for cis-eSNPs of ATP5SL and MCCC1 genes, with T2D and BMI, respectively. This study developed the first comprehensive map of adipose and muscle tissue eQTLs in AAs (publically accessible at https://mdsetaa.phs.wakehealth.edu ) and identified genetically regulated transcripts for delineating genetic causes of T2D, and related metabolic phenotypes.


Assuntos
Tecido Adiposo/metabolismo , Diabetes Mellitus Tipo 2/genética , Músculos/metabolismo , Obesidade/genética , Locos de Características Quantitativas/genética , Tecido Adiposo/patologia , Adolescente , Adulto , Negro ou Afro-Americano/genética , Mapeamento Cromossômico , Diabetes Mellitus Tipo 2/patologia , Feminino , Regulação da Expressão Gênica , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Masculino , Pessoa de Meia-Idade , Músculos/patologia , Obesidade/patologia
8.
Genome Res ; 23(3): 509-18, 2013 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-23233546

RESUMO

Most current genotype imputation methods are model-based and computationally intensive, taking days to impute one chromosome pair on 1000 people. We describe an efficient genotype imputation method based on matrix completion. Our matrix completion method is implemented in MATLAB and tested on real data from HapMap 3, simulated pedigree data, and simulated low-coverage sequencing data derived from the 1000 Genomes Project. Compared with leading imputation programs, the matrix completion algorithm embodied in our program MENDEL-IMPUTE achieves comparable imputation accuracy while reducing run times significantly. Implementation in a lower-level language such as Fortran or C is apt to further improve computational efficiency.


Assuntos
Inteligência Artificial , Genótipo , Modelos Genéticos , Software , Algoritmos , Simulação por Computador , Genoma Humano , Projeto HapMap , Humanos , Análise em Microsséries , Polimorfismo de Nucleotídeo Único
9.
Bioinformatics ; 31(21): 3549-51, 2015 Nov 01.
Artigo em Inglês | MEDLINE | ID: mdl-26142186

RESUMO

MOTIVATION: The development of Approximate Bayesian Computation (ABC) algorithms for parameter inference which are both computationally efficient and scalable in parallel computing environments is an important area of research. Monte Carlo rejection sampling, a fundamental component of ABC algorithms, is trivial to distribute over multiple processors but is inherently inefficient. While development of algorithms such as ABC Sequential Monte Carlo (ABC-SMC) help address the inherent inefficiencies of rejection sampling, such approaches are not as easily scaled on multiple processors. As a result, current Bayesian inference software offerings that use ABC-SMC lack the ability to scale in parallel computing environments. RESULTS: We present al3c, a C++ framework for implementing ABC-SMC in parallel. By requiring only that users define essential functions such as the simulation model and prior distribution function, al3c abstracts the user from both the complexities of parallel programming and the details of the ABC-SMC algorithm. By using the al3c framework, the user is able to scale the ABC-SMC algorithm in parallel computing environments for his or her specific application, with minimal programming overhead. AVAILABILITY AND IMPLEMENTATION: al3c is offered as a static binary for Linux and OS-X computing environments. The user completes an XML configuration file and C++ plug-in template for the specific application, which are used by al3c to obtain the desired results. Users can download the static binaries, source code, reference documentation and examples (including those in this article) by visiting https://github.com/ahstram/al3c. CONTACT: astram@usc.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Modelos Biológicos , Software , Algoritmos , Animais , Teorema de Bayes , Método de Monte Carlo
10.
PLoS Comput Biol ; 11(5): e1004228, 2015 May.
Artigo em Inglês | MEDLINE | ID: mdl-25965340

RESUMO

The primary goal in cluster analysis is to discover natural groupings of objects. The field of cluster analysis is crowded with diverse methods that make special assumptions about data and address different scientific aims. Despite its shortcomings in accuracy, hierarchical clustering is the dominant clustering method in bioinformatics. Biologists find the trees constructed by hierarchical clustering visually appealing and in tune with their evolutionary perspective. Hierarchical clustering operates on multiple scales simultaneously. This is essential, for instance, in transcriptome data, where one may be interested in making qualitative inferences about how lower-order relationships like gene modules lead to higher-order relationships like pathways or biological processes. The recently developed method of convex clustering preserves the visual appeal of hierarchical clustering while ameliorating its propensity to make false inferences in the presence of outliers and noise. The solution paths generated by convex clustering reveal relationships between clusters that are hidden by static methods such as k-means clustering. The current paper derives and tests a novel proximal distance algorithm for minimizing the objective function of convex clustering. The algorithm separates parameters, accommodates missing data, and supports prior information on relationships. Our program CONVEXCLUSTER incorporating the algorithm is implemented on ATI and nVidia graphics processing units (GPUs) for maximal speed. Several biological examples illustrate the strengths of convex clustering and the ability of the proximal distance algorithm to handle high-dimensional problems. CONVEXCLUSTER can be freely downloaded from the UCLA Human Genetics web site at http://www.genetics.ucla.edu/software/.


Assuntos
Análise por Conglomerados , Biologia Computacional/métodos , Reconhecimento Automatizado de Padrão/métodos , Algoritmos , Bases de Dados Genéticas , Perfilação da Expressão Gênica/métodos , Humanos , Software
11.
Hum Mol Genet ; 21(8): 1907-17, 2012 Apr 15.
Artigo em Inglês | MEDLINE | ID: mdl-22228098

RESUMO

Among US Latinas and Mexican women, those with higher European ancestry have increased risk of breast cancer. We combined an admixture mapping and genome-wide association mapping approach to search for genomic regions that may explain this observation. Latina women with breast cancer (n= 1497) and Latina controls (n= 1272) were genotyped using Affymetrix and Illumina arrays. We inferred locus-specific genetic ancestry and compared the ancestry between cases and controls. We also performed single nucleotide polymorphism (SNP) association analyses in regions of interest. Correction for multiple-hypothesis testing was conducted using permutations (P(corrected)). We identified one region where genetic ancestry was significantly associated with breast cancer risk: 6q25 [odds ratio (OR) per Indigenous American chromosome 0.75, 95% confidence interval (CI): 0.65-0.85, P= 1.1 × 10(-5), P(corrected)= 0.02]. A second region on 11p15 showed a trend towards association (OR per Indigenous American chromosome 0.77, 95% CI: 0.68-0.87, P= 4.3 × 10(-5), P(corrected)= 0.08). In both regions, breast cancer risk decreased with higher Indigenous American ancestry in concordance with observations made on global ancestry. The peak of the 6q25 signal includes the estrogen receptor 1 (ESR1) gene and 5' region, a locus previously implicated in breast cancer. Genome-wide association analysis found that a multi-SNP model explained the admixture signal in both regions. Our results confirm that the association between genetic ancestry and breast cancer risk in US Latinas is partly due to genetic differences between populations of European and Indigenous Americans origin. Fine-mapping within the 6q25 and possibly the 11p15 loci will lead to the discovery of the biologically functional variant/s behind this association.


Assuntos
Neoplasias da Mama/genética , Cromossomos Humanos Par 6/genética , Receptor alfa de Estrogênio/genética , Loci Gênicos , Predisposição Genética para Doença , Hispânico ou Latino/genética , Neoplasias da Mama/classificação , Estudos de Casos e Controles , Mapeamento Cromossômico , Cromossomos Humanos Par 11/genética , Feminino , Frequência do Gene , Estudo de Associação Genômica Ampla , Genótipo , Humanos , Proteínas dos Microfilamentos/genética , Polimorfismo de Nucleotídeo Único , Fatores de Risco , População Branca/genética
12.
Hum Mol Genet ; 21(24): 5373-84, 2012 Dec 15.
Artigo em Inglês | MEDLINE | ID: mdl-22976474

RESUMO

Genome-wide association studies (GWAS) of breast cancer defined by hormone receptor status have revealed loci contributing to susceptibility of estrogen receptor (ER)-negative subtypes. To identify additional genetic variants for ER-negative breast cancer, we conducted the largest meta-analysis of ER-negative disease to date, comprising 4754 ER-negative cases and 31 663 controls from three GWAS: NCI Breast and Prostate Cancer Cohort Consortium (BPC3) (2188 ER-negative cases; 25 519 controls of European ancestry), Triple Negative Breast Cancer Consortium (TNBCC) (1562 triple negative cases; 3399 controls of European ancestry) and African American Breast Cancer Consortium (AABC) (1004 ER-negative cases; 2745 controls). We performed in silico replication of 86 SNPs at P ≤ 1 × 10(-5) in an additional 11 209 breast cancer cases (946 with ER-negative disease) and 16 057 controls of Japanese, Latino and European ancestry. We identified two novel loci for breast cancer at 20q11 and 6q14. SNP rs2284378 at 20q11 was associated with ER-negative breast cancer (combined two-stage OR = 1.16; P = 1.1 × 10(-8)) but showed a weaker association with overall breast cancer (OR = 1.08, P = 1.3 × 10(-6)) based on 17 869 cases and 43 745 controls and no association with ER-positive disease (OR = 1.01, P = 0.67) based on 9965 cases and 22 902 controls. Similarly, rs17530068 at 6q14 was associated with breast cancer (OR = 1.12; P = 1.1 × 10(-9)), and with both ER-positive (OR = 1.09; P = 1.5 × 10(-5)) and ER-negative (OR = 1.16, P = 2.5 × 10(-7)) disease. We also confirmed three known loci associated with ER-negative (19p13) and both ER-negative and ER-positive breast cancer (6q25 and 12p11). Our results highlight the value of large-scale collaborative studies to identify novel breast cancer risk loci.


Assuntos
Neoplasias da Mama/genética , Predisposição Genética para Doença/genética , Estudo de Associação Genômica Ampla , Feminino , Humanos , Polimorfismo de Nucleotídeo Único/genética , Receptores de Estrogênio/genética
13.
Bioinformatics ; 29(23): 2964-70, 2013 Dec 01.
Artigo em Inglês | MEDLINE | ID: mdl-24021380

RESUMO

MOTIVATION: The accurate detection of copy number alterations (CNAs) in human genomes is important for understanding susceptibility to cancer and mechanisms of tumor progression. CNA detection in tumors from single nucleotide polymorphism (SNP) genotyping arrays is a challenging problem due to phenomena such as aneuploidy, stromal contamination, genomic waves and intra-tumor heterogeneity, issues that leading methods do not optimally address. RESULTS: Here we introduce methods and software (PennCNV-tumor) for fast and accurate CNA detection using signal intensity data from SNP genotyping arrays. We estimate stromal contamination by applying a maximum likelihood approach over multiple discrete genomic intervals. By conditioning on signal intensity across the genome, our method accounts for both aneuploidy and genomic waves. Finally, our method uses a hidden Markov model to integrate multiple sources of information, including total and allele-specific signal intensity at each SNP, as well as physical maps to make posterior inferences of CNAs. Using real data from cancer cell-lines and patient tumors, we demonstrate substantial improvements in accuracy and computational efficiency compared with existing methods.


Assuntos
Neoplasias da Mama/genética , Biologia Computacional , Variações do Número de Cópias de DNA/genética , Genoma Humano , Polimorfismo de Nucleotídeo Único/genética , Aneuploidia , Neoplasias da Mama/patologia , Linhagem Celular Tumoral , Aberrações Cromossômicas , Feminino , Genômica , Genótipo , Humanos , Funções Verossimilhança , Cadeias de Markov , Análise de Sequência com Séries de Oligonucleotídeos , Software , Células Estromais/metabolismo , Células Estromais/patologia
14.
Bioinformatics ; 29(11): 1407-15, 2013 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-23572411

RESUMO

MOTIVATION: Local ancestry analysis of genotype data from recently admixed populations (e.g. Latinos, African Americans) provides key insights into population history and disease genetics. Although methods for local ancestry inference have been extensively validated in simulations (under many unrealistic assumptions), no empirical study of local ancestry accuracy in Latinos exists to date. Hence, interpreting findings that rely on local ancestry in Latinos is challenging. RESULTS: Here, we use 489 nuclear families from the mainland USA, Puerto Rico and Mexico in conjunction with 3204 unrelated Latinos from the Multiethnic Cohort study to provide the first empirical characterization of local ancestry inference accuracy in Latinos. Our approach for identifying errors does not rely on simulations but on the observation that local ancestry in families follows Mendelian inheritance. We measure the rate of local ancestry assignments that lead to Mendelian inconsistencies in local ancestry in trios (MILANC), which provides a lower bound on errors in the local ancestry estimates. We show that MILANC rates observed in simulations underestimate the rate observed in real data, and that MILANC varies substantially across the genome. Second, across a wide range of methods, we observe that loci with large deviations in local ancestry also show enrichment in MILANC rates. Therefore, local ancestry estimates at such loci should be interpreted with caution. Finally, we reconstruct ancestral haplotype panels to be used as reference panels in local ancestry inference and show that ancestry inference is significantly improved by incoroprating these reference panels. AVAILABILITY AND IMPLEMENTATION: We provide the reconstructed reference panels together with the maps of MILANC rates as a public resource for researchers analyzing local ancestry in Latinos at http://bogdanlab.pathology.ucla.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Hispânico ou Latino/genética , Viés , Estudos de Coortes , Família , Loci Gênicos , Genética Populacional/métodos , Genoma Humano , Estudo de Associação Genômica Ampla , Genótipo , Haplótipos , Humanos , Americanos Mexicanos , Porto Rico/etnologia , Estados Unidos/etnologia
15.
PLoS Genet ; 7(5): e1001387, 2011 May.
Artigo em Inglês | MEDLINE | ID: mdl-21637779

RESUMO

GWAS of prostate cancer have been remarkably successful in revealing common genetic variants and novel biological pathways that are linked with its etiology. A more complete understanding of inherited susceptibility to prostate cancer in the general population will come from continuing such discovery efforts and from testing known risk alleles in diverse racial and ethnic groups. In this large study of prostate cancer in African American men (3,425 prostate cancer cases and 3,290 controls), we tested 49 risk variants located in 28 genomic regions identified through GWAS in men of European and Asian descent, and we replicated associations (at p≤0.05) with roughly half of these markers. Through fine-mapping, we identified nearby markers in many regions that better define associations in African Americans. At 8q24, we found 9 variants (p≤6×10(-4)) that best capture risk of prostate cancer in African Americans, many of which are more common in men of African than European descent. The markers found to be associated with risk at each locus improved risk modeling in African Americans (per allele OR = 1.17) over the alleles reported in the original GWAS (OR = 1.08). In summary, in this detailed analysis of the prostate cancer risk loci reported from GWAS, we have validated and improved upon markers of risk in some regions that better define the association with prostate cancer in African Americans. Our findings with variants at 8q24 also reinforce the importance of this region as a major risk locus for prostate cancer in men of African ancestry.


Assuntos
Negro ou Afro-Americano/genética , Loci Gênicos , Estudo de Associação Genômica Ampla , Neoplasias da Próstata/genética , Adulto , Idoso , Idoso de 80 Anos ou mais , População Negra/genética , Estudos de Casos e Controles , Cromossomos Humanos Par 8/genética , Estudos de Coortes , Frequência do Gene , Genótipo , Humanos , Desequilíbrio de Ligação , Masculino , Pessoa de Meia-Idade , Polimorfismo de Nucleotídeo Único , Neoplasias da Próstata/etnologia , População Branca/genética , Adulto Jovem
16.
PLoS Genet ; 7(4): e1001371, 2011 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-21541012

RESUMO

While genome-wide association studies (GWAS) have primarily examined populations of European ancestry, more recent studies often involve additional populations, including admixed populations such as African Americans and Latinos. In admixed populations, linkage disequilibrium (LD) exists both at a fine scale in ancestral populations and at a coarse scale (admixture-LD) due to chromosomal segments of distinct ancestry. Disease association statistics in admixed populations have previously considered SNP association (LD mapping) or admixture association (mapping by admixture-LD), but not both. Here, we introduce a new statistical framework for combining SNP and admixture association in case-control studies, as well as methods for local ancestry-aware imputation. We illustrate the gain in statistical power achieved by these methods by analyzing data of 6,209 unrelated African Americans from the CARe project genotyped on the Affymetrix 6.0 chip, in conjunction with both simulated and real phenotypes, as well as by analyzing the FGFR2 locus using breast cancer GWAS data from 5,761 African-American women. We show that, at typed SNPs, our method yields an 8% increase in statistical power for finding disease risk loci compared to the power achieved by standard methods in case-control studies. At imputed SNPs, we observe an 11% increase in statistical power for mapping disease loci when our local ancestry-aware imputation framework and the new scoring statistic are jointly employed. Finally, we show that our method increases statistical power in regions harboring the causal SNP in the case when the causal SNP is untyped and cannot be imputed. Our methods and our publicly available software are broadly applicable to GWAS in admixed populations.


Assuntos
Negro ou Afro-Americano/genética , Neoplasias da Mama/genética , Genoma Humano , Estudo de Associação Genômica Ampla/métodos , Receptor Tipo 2 de Fator de Crescimento de Fibroblastos/genética , Negro ou Afro-Americano/estatística & dados numéricos , Algoritmos , Mapeamento Cromossômico , Doença das Coronárias/genética , Diabetes Mellitus Tipo 2/genética , Feminino , Frequência do Gene , Variação Genética , Genética Populacional/estatística & dados numéricos , Estudo de Associação Genômica Ampla/estatística & dados numéricos , Genótipo , Humanos , Desequilíbrio de Ligação , Masculino , Razão de Chances , Fenótipo , Polimorfismo de Nucleotídeo Único , Análise de Componente Principal , Software
17.
PLoS Genet ; 7(10): e1002298, 2011 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-21998595

RESUMO

Adult height is a classic polygenic trait of high heritability (h(2) approximately 0.8). More than 180 single nucleotide polymorphisms (SNPs), identified mostly in populations of European descent, are associated with height. These variants convey modest effects and explain approximately10% of the variance in height. Discovery efforts in other populations, while limited, have revealed loci for height not previously implicated in individuals of European ancestry. Here, we performed a meta-analysis of genome-wide association (GWA) results for adult height in 20,427 individuals of African ancestry with replication in up to 16,436 African Americans. We found two novel height loci (Xp22-rs12393627, P = 3.4×10(-12) and 2p14-rs4315565, P = 1.2×10(-8)). As a group, height associations discovered in European-ancestry samples replicate in individuals of African ancestry (P = 1.7×10(-4) for overall replication). Fine-mapping of the European height loci in African-ancestry individuals showed an enrichment of SNPs that are associated with expression of nearby genes when compared to the index European height SNPs (P<0.01). Our results highlight the utility of genetic studies in non-European populations to understand the etiology of complex human diseases and traits.


Assuntos
Negro ou Afro-Americano/genética , Estatura/genética , Adulto , Idoso , Idoso de 80 Anos ou mais , Mapeamento Cromossômico , Feminino , Frequência do Gene , Estudo de Associação Genômica Ampla , Genótipo , Humanos , Masculino , Pessoa de Meia-Idade , Fenótipo , Polimorfismo de Nucleotídeo Único , População Branca/genética
18.
Hum Mol Genet ; 20(22): 4491-503, 2011 Nov 15.
Artigo em Inglês | MEDLINE | ID: mdl-21852243

RESUMO

Genome-wide association studies (GWAS) have revealed 19 common genetic variants that are associated with breast cancer risk. Testing of the index signals found through GWAS and fine-mapping of each locus in diverse populations will be necessary for characterizing the role of these risk regions in contributing to inherited susceptibility. In this large study of breast cancer in African-American women (3016 cases and 2745 controls), we tested the 19 known risk variants identified by GWAS and replicated associations (P < 0.05) with only 4 variants. Through fine-mapping, we identified markers in four regions that better capture the association with breast cancer risk in African Americans as defined by the index signal (2q35, 5q11, 10q26 and 19p13). We also identified statistically significant associations with markers in four separate regions (8q24, 10q22, 11q13 and 16q12) that are independent of the index signals and may represent putative novel risk variants. In aggregate, the more informative markers found in the study enhance the association of these risk regions with breast cancer in African Americans [per allele odds ratio (OR) = 1.18, P = 2.8 × 10(-24) versus OR = 1.04, P = 6.1 × 10(-5)]. In this detailed analysis of the known breast cancer risk loci, we have validated and improved upon markers of risk that better characterize their association with breast cancer in women of African ancestry.


Assuntos
Neoplasias da Mama/genética , Adulto , Negro ou Afro-Americano/genética , Idoso , Idoso de 80 Anos ou mais , Cromossomos Humanos Par 10/genética , Cromossomos Humanos Par 11/genética , Cromossomos Humanos Par 16/genética , Cromossomos Humanos Par 8/genética , Feminino , Predisposição Genética para Doença/genética , Estudo de Associação Genômica Ampla , Humanos , Pessoa de Meia-Idade , Razão de Chances , Adulto Jovem
19.
Hum Genet ; 132(1): 39-48, 2013 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-22923054

RESUMO

Genome-wide association studies (GWAS) in diverse populations are needed to reveal variants that are more common and/or limited to defined populations. We conducted a GWAS of breast cancer in women of African ancestry, with genotyping of >1,000,000 SNPs in 3,153 African American cases and 2,831 controls, and replication testing of the top 66 associations in an additional 3,607 breast cancer cases and 11,330 controls of African ancestry. Two of the 66 SNPs replicated (p < 0.05) in stage 2, which reached statistical significance levels of 10(-6) and 10(-5) in the stage 1 and 2 combined analysis (rs4322600 at chromosome 14q31: OR = 1.18, p = 4.3 × 10(-6); rs10510333 at chromosome 3p26: OR = 1.15, p = 1.5 × 10(-5)). These suggestive risk loci have not been identified in previous GWAS in other populations and will need to be examined in additional samples. Identification of novel risk variants for breast cancer in women of African ancestry will demand testing of a substantially larger set of markers from stage 1 in a larger replication sample.


Assuntos
População Negra/genética , Negro ou Afro-Americano/genética , Neoplasias da Mama/genética , Polimorfismo de Nucleotídeo Único , Adulto , Idoso , Idoso de 80 Anos ou mais , Estudos de Casos e Controles , Estudos de Coortes , Feminino , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Pessoa de Meia-Idade , Fatores de Risco , Adulto Jovem
20.
Bioinformatics ; 28(5): 719-20, 2012 Mar 01.
Artigo em Inglês | MEDLINE | ID: mdl-22238272

RESUMO

UNLABELLED: The deluge of data emerging from high-throughput sequencing technologies poses large analytical challenges when testing for association to disease. We introduce a scalable framework for variable selection, implemented in C++ and OpenCL, that fits regularized regression across multiple Graphics Processing Units. Open source code and documentation can be found at a Google Code repository under the URL http://bioinformatics.oxfordjournals.org/content/early/2012/01/10/bioinformatics.bts015.abstract. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Algoritmos , Estudo de Associação Genômica Ampla , Software , Humanos , Masculino , Polimorfismo de Nucleotídeo Único , Linguagens de Programação , Neoplasias da Próstata/etnologia , Neoplasias da Próstata/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA