Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
1.
Genet Epidemiol ; 47(3): 231-248, 2023 04.
Artículo en Inglés | MEDLINE | ID: mdl-36739617

RESUMEN

Linkage analysis, a class of methods for detecting co-segregation of genomic segments and traits in families, was used to map disease-causing genes for decades before genotyping arrays and dense SNP genotyping enabled genome-wide association studies in population samples. Population samples often contain related individuals, but the segregation of alleles within families is rarely used because traditional linkage methods are computationally inefficient for larger datasets. Here, we describe Population Linkage, a novel application of Haseman-Elston regression as a method of moments estimator of variance components and their standard errors. We achieve additional computational efficiency by using modern methods for detection of IBD segments and variance component estimation, efficient preprocessing of input data, and minimizing redundant numerical calculations. We also refined variance component models to account for the biases in population-scale methods for IBD segment detection. We ran Population Linkage on four blood lipid traits in over 70,000 individuals from the HUNT and SardiNIA studies, successfully detecting 25 known genetic signals. One notable linkage signal that appeared in both was for low-density lipoprotein (LDL) cholesterol levels in the region near the gene APOE (LOD = 29.3, variance explained = 4.1%). This is the region where the missense variants rs7412 and rs429358, which together make up the ε2, ε3, and ε4 alleles each account for 2.4% and 0.8% of variation in circulating LDL cholesterol. Our results show the potential for linkage analysis and other large-scale applications of method of moments variance components estimation.


Asunto(s)
Estudio de Asociación del Genoma Completo , Modelos Genéticos , Humanos , Fenotipo , LDL-Colesterol/genética , Ligamiento Genético , Apolipoproteínas E/genética
2.
Am J Hum Genet ; 105(1): 65-77, 2019 07 03.
Artículo en Inglés | MEDLINE | ID: mdl-31204010

RESUMEN

The Genes for Good study uses social media to engage a large, diverse participant pool in genetics research and education. Health history and daily tracking surveys are administered through a Facebook application, and participants who complete a minimum number of surveys are mailed a saliva sample kit ("spit kit") to collect DNA for genotyping. As of March 2019, we engaged >80,000 individuals, sent spit kits to >32,000 individuals who met minimum participation requirements, and collected >27,000 spit kits. Participants come from all 50 states and include a diversity of ancestral backgrounds. Rates of important chronic health indicators are consistent with those estimated for the general U.S. population using more traditional study designs. However, our sample is younger and contains a greater percentage of females than the general population. As one means of verifying data quality, we have replicated genome-wide association studies (GWASs) for exemplar traits, such as asthma, diabetes, body mass index (BMI), and pigmentation. The flexible framework of the web application makes it relatively simple to add new questionnaires and for other researchers to collaborate. We anticipate that the study sample will continue to grow and that future analyses may further capitalize on the strengths of the longitudinal data in combination with genetic information.


Asunto(s)
Genes/genética , Marcadores Genéticos , Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Proyectos de Investigación , Medios de Comunicación Sociales , Adolescente , Adulto , Diabetes Mellitus/diagnóstico , Diabetes Mellitus/genética , Femenino , Humanos , Hipertensión/diagnóstico , Hipertensión/genética , Masculino , Persona de Mediana Edad , Salud Pública , Encuestas y Cuestionarios , Adulto Joven
3.
PLoS Genet ; 14(7): e1007452, 2018 07.
Artículo en Inglés | MEDLINE | ID: mdl-30016313

RESUMEN

Meta-analysis of genetic association studies increases sample size and the power for mapping complex traits. Existing methods are mostly developed for datasets without missing values, i.e. the summary association statistics are measured for all variants in contributing studies. In practice, genotype imputation is not always effective. This may be the case when targeted genotyping/sequencing assays are used or when the un-typed genetic variant is rare. Therefore, contributed summary statistics often contain missing values. Existing methods for imputing missing summary association statistics and using imputed values in meta-analysis, approximate conditional analysis, or simple strategies such as complete case analysis all have theoretical limitations. Applying these approaches can bias genetic effect estimates and lead to seriously inflated type-I or type-II errors in conditional analysis, which is a critical tool for identifying independently associated variants. To address this challenge and complement imputation methods, we developed a method to combine summary statistics across participating studies and consistently estimate joint effects, even when the contributed summary statistics contain large amounts of missing values. Based on this estimator, we proposed a score statistic called PCBS (partial correlation based score statistic) for conditional analysis of single-variant and gene-level associations. Through extensive analysis of simulated and real data, we showed that the new method produces well-calibrated type-I errors and is substantially more powerful than existing approaches. We applied the proposed approach to one of the largest meta-analyses to date for the cigarettes-per-day phenotype. Using the new method, we identified multiple novel independently associated variants at known loci for tobacco use, which were otherwise missed by alternative methods. Together, the phenotypic variance explained by these variants was 1.1%, improving that of previously reported associations by 71%. These findings illustrate the extent of locus allelic heterogeneity and can help pinpoint causal variants.


Asunto(s)
Análisis de Datos , Productos de Tabaco/estadística & datos numéricos , Uso de Tabaco/genética , Alelos , Interpretación Estadística de Datos , Conjuntos de Datos como Asunto , Sitios Genéticos/genética , Estudio de Asociación del Genoma Completo , Genotipo , Humanos , Fenotipo , Polimorfismo de Nucleótido Simple
4.
Genet Epidemiol ; 43(8): 980-995, 2019 12.
Artículo en Inglés | MEDLINE | ID: mdl-31452258

RESUMEN

Array genotyping is a cost-effective and widely used tool that enables assessment of up to millions of genetic markers in hundreds of thousands of individuals. Genotyping array data are typically highly accurate but sensitive to mixing of DNA samples from multiple individuals before or during genotyping. Contaminated samples can lead to genotyping errors and consequently cause false positive signals or reduce power of association analyses. Here, we propose a new method to identify contaminated samples and the sources of contamination within a genotyping batch. Through analysis of array intensity and genotype data from intentionally mixed samples and 22,366 samples of the Michigan Genomics Initiative, an ongoing biobank-based study, we show that our method can reliably estimate contamination. We also show that identifying sources of contamination can implicate problematic sample processing steps and guide process improvements. Compared to existing methods, our approach can estimate the proportion of contaminating DNA more accurately, eliminate the need for external databases of allele frequencies, and provide contamination estimates that are more robust to the ancestral origin of the contaminating sample.


Asunto(s)
Contaminación de ADN , Técnicas de Genotipaje , ADN , Frecuencia de los Genes , Marcadores Genéticos , Genómica/métodos , Genotipo , Técnicas de Genotipaje/métodos , Humanos , Polimorfismo de Nucleótido Simple
6.
Cell Metab ; 33(2): 350-366.e7, 2021 02 02.
Artículo en Inglés | MEDLINE | ID: mdl-33186557

RESUMEN

Efficient delivery of specific cargos in vivo poses a major challenge to the secretory pathway, which shuttles products encoded by ∼30% of the genome. Newly synthesized protein and lipid cargos embark on the secretory pathway via COPII-coated vesicles, assembled by the GTPase SAR1 on the endoplasmic reticulum (ER), but how lipid-carrying lipoproteins are distinguished from the general protein cargos in the ER and selectively secreted has not been clear. Here, we show that this process is quantitatively governed by the GTPase SAR1B and SURF4, a high-efficiency cargo receptor. While both genes are implicated in lipid regulation in humans, hepatic inactivation of either mouse Sar1b or Surf4 selectively depletes plasma lipids to near-zero and protects the mice from atherosclerosis. These findings show that the pairing between SURF4 and SAR1B synergistically operates a specialized, dosage-sensitive transport program for circulating lipids, while further suggesting a potential translation to treat atherosclerosis and related cardio-metabolic diseases.


Asunto(s)
Retículo Endoplásmico/metabolismo , Lipoproteínas/metabolismo , Proteínas de la Membrana/metabolismo , Proteínas de Unión al GTP Monoméricas/metabolismo , Animales , Células Cultivadas , Homeostasis , Humanos , Lípidos/sangre , Lípidos/química , Masculino , Ratones , Ratones Endogámicos C57BL , Ratones Transgénicos
7.
Genes (Basel) ; 11(5)2020 05 25.
Artículo en Inglés | MEDLINE | ID: mdl-32466134

RESUMEN

There is great interest in understanding the impact of rare variants in human diseases using large sequence datasets. In deep sequence datasets of >10,000 samples, ~10% of the variant sites are observed to be multi-allelic. Many of the multi-allelic variants have been shown to be functional and disease-relevant. Proper analysis of multi-allelic variants is critical to the success of a sequencing study, but existing methods do not properly handle multi-allelic variants and can produce highly misleading association results. We discuss practical issues and methods to encode multi-allelic sites, conduct single-variant and gene-level association analyses, and perform meta-analysis for multi-allelic variants. We evaluated these methods through extensive simulations and the study of a large meta-analysis of ~18,000 samples on the cigarettes-per-day phenotype. We showed that our joint modeling approach provided an unbiased estimate of genetic effects, greatly improved the power of single-variant association tests among methods that can properly estimate allele effects, and enhanced gene-level tests over existing approaches. Software packages implementing these methods are available online.


Asunto(s)
Fumar Cigarrillos/genética , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo/estadística & datos numéricos , Enfermedades Raras/genética , Alelos , Interpretación Estadística de Datos , Femenino , Variación Genética/genética , Humanos , Masculino , Fenotipo , Polimorfismo de Nucleótido Simple/genética , Enfermedades Raras/epidemiología , Enfermedades Raras/patología
8.
Cell Metab ; 31(4): 741-754.e5, 2020 04 07.
Artículo en Inglés | MEDLINE | ID: mdl-32197071

RESUMEN

Identifying the causal gene(s) that connects genetic variation to a phenotype is a challenging problem in genome-wide association studies (GWASs). Here, we develop a systematic approach that integrates mouse liver co-expression networks with human lipid GWAS data to identify regulators of cholesterol and lipid metabolism. Through our approach, we identified 48 genes showing replication in mice and associated with plasma lipid traits in humans and six genes on the X chromosome. Among these 54 genes, 25 have no previously identified role in lipid metabolism. Based on functional studies and integration with additional human lipid GWAS datasets, we pinpoint Sestrin1 as a causal gene associated with plasma cholesterol levels in humans. Our validation studies demonstrate that Sestrin1 influences plasma cholesterol in multiple mouse models and regulates cholesterol biosynthesis. Our results highlight the power of combining mouse and human datasets for prioritization of human lipid GWAS loci and discovery of lipid genes.


Asunto(s)
Colesterol , Estudio de Asociación del Genoma Completo/métodos , Proteínas de Choque Térmico/fisiología , Animales , Colesterol/sangre , Colesterol/metabolismo , Bases de Datos Genéticas , Humanos , Ratones
9.
Nat Commun ; 11(1): 6417, 2020 12 18.
Artículo en Inglés | MEDLINE | ID: mdl-33339817

RESUMEN

Pharmaceutical drugs targeting dyslipidemia and cardiovascular disease (CVD) may increase the risk of fatty liver disease and other metabolic disorders. To identify potential novel CVD drug targets without these adverse effects, we perform genome-wide analyses of participants in the HUNT Study in Norway (n = 69,479) to search for protein-altering variants with beneficial impact on quantitative blood traits related to cardiovascular disease, but without detrimental impact on liver function. We identify 76 (11 previously unreported) presumed causal protein-altering variants associated with one or more CVD- or liver-related blood traits. Nine of the variants are predicted to result in loss-of-function of the protein. This includes ZNF529:p.K405X, which is associated with decreased low-density-lipoprotein (LDL) cholesterol (P = 1.3 × 10-8) without being associated with liver enzymes or non-fasting blood glucose. Silencing of ZNF529 in human hepatoma cells results in upregulation of LDL receptor and increased LDL uptake in the cells. This suggests that inhibition of ZNF529 or its gene product should be prioritized as a novel candidate drug target for treating dyslipidemia and associated CVD.


Asunto(s)
Enfermedades Cardiovasculares/genética , Genoma Humano , Mutación con Pérdida de Función/genética , Terapia Molecular Dirigida , Bancos de Muestras Biológicas , Enfermedades Cardiovasculares/sangre , Silenciador del Gen , Marcación de Gen , Estudio de Asociación del Genoma Completo , Humanos , Lípidos/sangre , Hígado/metabolismo , Fenómica , Receptores de LDL/genética , Reino Unido
10.
Nat Genet ; 51(2): 237-244, 2019 02.
Artículo en Inglés | MEDLINE | ID: mdl-30643251

RESUMEN

Tobacco and alcohol use are leading causes of mortality that influence risk for many complex diseases and disorders1. They are heritable2,3 and etiologically related4,5 behaviors that have been resistant to gene discovery efforts6-11. In sample sizes up to 1.2 million individuals, we discovered 566 genetic variants in 406 loci associated with multiple stages of tobacco use (initiation, cessation, and heaviness) as well as alcohol use, with 150 loci evidencing pleiotropic association. Smoking phenotypes were positively genetically correlated with many health conditions, whereas alcohol use was negatively correlated with these conditions, such that increased genetic risk for alcohol use is associated with lower disease risk. We report evidence for the involvement of many systems in tobacco and alcohol use, including genes involved in nicotinic, dopaminergic, and glutamatergic neurotransmission. The results provide a solid starting point to evaluate the effects of these loci in model organisms and more precise substance use measures.


Asunto(s)
Consumo de Bebidas Alcohólicas/genética , Fumar/genética , Tabaquismo/genética , Femenino , Variación Genética/genética , Estudio de Asociación del Genoma Completo/métodos , Humanos , Masculino , Persona de Mediana Edad , Fenotipo , Riesgo , Nicotiana/efectos adversos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA