Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 106
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Cell ; 186(19): 4085-4099.e15, 2023 09 14.
Artigo em Inglês | MEDLINE | ID: mdl-37714134

RESUMO

Many sequence variants have additive effects on blood lipid levels and, through that, on the risk of coronary artery disease (CAD). We show that variants also have non-additive effects and interact to affect lipid levels as well as affecting variance and correlations. Variance and correlation effects are often signatures of epistasis or gene-environmental interactions. These complex effects can translate into CAD risk. For example, Trp154Ter in FUT2 protects against CAD among subjects with the A1 blood group, whereas it associates with greater risk of CAD in others. His48Arg in ADH1B interacts with alcohol consumption to affect lipid levels and CAD. The effect of variants in TM6SF2 on blood lipids is greatest among those who never eat oily fish but absent from those who often do. This work demonstrates that variants that affect variance of quantitative traits can allow for the discovery of epistasis and interactions of variants with the environment.


Assuntos
Doença da Artéria Coronariana , Animais , Humanos , Doença da Artéria Coronariana/sangue , Doença da Artéria Coronariana/genética , Epistasia Genética , Fenótipo , Lipídeos/sangue , Sistema ABO de Grupos Sanguíneos
2.
Nature ; 622(7982): 348-358, 2023 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-37794188

RESUMO

High-throughput proteomics platforms measuring thousands of proteins in plasma combined with genomic and phenotypic information have the power to bridge the gap between the genome and diseases. Here we performed association studies of Olink Explore 3072 data generated by the UK Biobank Pharma Proteomics Project1 on plasma samples from more than 50,000 UK Biobank participants with phenotypic and genotypic data, stratifying on British or Irish, African and South Asian ancestries. We compared the results with those of a SomaScan v4 study on plasma from 36,000 Icelandic people2, for 1,514 of whom Olink data were also available. We found modest correlation between the two platforms. Although cis protein quantitative trait loci were detected for a similar absolute number of assays on the two platforms (2,101 on Olink versus 2,120 on SomaScan), the proportion of assays with such supporting evidence for assay performance was higher on the Olink platform (72% versus 43%). A considerable number of proteins had genomic associations that differed between the platforms. We provide examples where differences between platforms may influence conclusions drawn from the integration of protein levels with the study of diseases. We demonstrate how leveraging the diverse ancestries of participants in the UK Biobank helps to detect novel associations and refine genomic location. Our results show the value of the information provided by the two most commonly used high-throughput proteomics platforms and demonstrate the differences between them that at times provides useful complementarity.


Assuntos
Proteínas Sanguíneas , Suscetibilidade a Doenças , Genômica , Genótipo , Fenótipo , Proteômica , Humanos , África/etnologia , Ásia Meridional/etnologia , Bancos de Espécimes Biológicos , Proteínas Sanguíneas/análise , Proteínas Sanguíneas/genética , Conjuntos de Dados como Assunto , Genoma Humano/genética , Islândia/etnologia , Irlanda/etnologia , Plasma/química , Proteoma/análise , Proteoma/genética , Proteômica/métodos , Locos de Características Quantitativas , Reino Unido
3.
Nature ; 607(7920): 732-740, 2022 07.
Artigo em Inglês | MEDLINE | ID: mdl-35859178

RESUMO

Detailed knowledge of how diversity in the sequence of the human genome affects phenotypic diversity depends on a comprehensive and reliable characterization of both sequences and phenotypic variation. Over the past decade, insights into this relationship have been obtained from whole-exome sequencing or whole-genome sequencing of large cohorts with rich phenotypic data1,2. Here we describe the analysis of whole-genome sequencing of 150,119 individuals from the UK Biobank3. This constitutes a set of high-quality variants, including 585,040,410 single-nucleotide polymorphisms, representing 7.0% of all possible human single-nucleotide polymorphisms, and 58,707,036 indels. This large set of variants allows us to characterize selection based on sequence variation within a population through a depletion rank score of windows along the genome. Depletion rank analysis shows that coding exons represent a small fraction of regions in the genome subject to strong sequence conservation. We define three cohorts within the UK Biobank: a large British Irish cohort, a smaller African cohort and a South Asian cohort. A haplotype reference panel is provided that allows reliable imputation of most variants carried by three or more sequenced individuals. We identified 895,055 structural variants and 2,536,688 microsatellites, groups of variants typically excluded from large-scale whole-genome sequencing studies. Using this formidable new resource, we provide several examples of trait associations for rare variants with large effects not found previously through studies based on whole-exome sequencing and/or imputation.


Assuntos
Bancos de Espécimes Biológicos , Bases de Dados Genéticas , Variação Genética , Genoma Humano , Genômica , Sequenciamento Completo do Genoma , África/etnologia , Ásia/etnologia , Estudos de Coortes , Sequência Conservada , Éxons/genética , Genoma Humano/genética , Haplótipos/genética , Humanos , Mutação INDEL , Irlanda/etnologia , Repetições de Microssatélites , Polimorfismo de Nucleotídeo Único/genética , Reino Unido
4.
N Engl J Med ; 389(19): 1741-1752, 2023 Nov 09.
Artigo em Inglês | MEDLINE | ID: mdl-37937776

RESUMO

BACKGROUND: In 2021, the American College of Medical Genetics and Genomics (ACMG) recommended reporting actionable genotypes in 73 genes associated with diseases for which preventive or therapeutic measures are available. Evaluations of the association of actionable genotypes in these genes with life span are currently lacking. METHODS: We assessed the prevalence of coding and splice variants in genes on the ACMG Secondary Findings, version 3.0 (ACMG SF v3.0), list in the genomes of 57,933 Icelanders. We assigned pathogenicity to all reviewed variants using reported evidence in the ClinVar database, the frequency of variants, and their associations with disease to create a manually curated set of actionable genotypes (variants). We assessed the relationship between these genotypes and life span and further examined the specific causes of death among carriers. RESULTS: Through manual curation of 4405 sequence variants in the ACMG SF v3.0 genes, we identified 235 actionable genotypes in 53 genes. Of the 57,933 participants, 2306 (4.0%) carried at least one actionable genotype. We found shorter median survival among persons carrying actionable genotypes than among noncarriers. Specifically, we found that carrying an actionable genotype in a cancer gene was associated with survival that was 3 years shorter than that among noncarriers, with causes of death among carriers attributed primarily to cancer-related conditions. Furthermore, we found evidence of association between carrying an actionable genotype in certain genes in the cardiovascular disease group and a reduced life span. CONCLUSIONS: On the basis of the ACMG SF v3.0 guidelines, we found that approximately 1 in 25 Icelanders carried an actionable genotype and that carrying such a genotype was associated with a reduced life span. (Funded by deCODE Genetics-Amgen.).


Assuntos
Doença , Genômica , Longevidade , Humanos , Alelos , Testes Genéticos , Variação Genética , Genótipo , Islândia/epidemiologia , Longevidade/genética , Doença/genética , Doenças Cardiovasculares/genética , Neoplasias/genética
5.
Nature ; 582(7810): 78-83, 2020 06.
Artigo em Inglês | MEDLINE | ID: mdl-32494067

RESUMO

Human evolutionary history is rich with the interbreeding of divergent populations. Most humans outside of Africa trace about 2% of their genomes to admixture from Neanderthals, which occurred 50-60 thousand years ago1. Here we examine the effect of this event using 14.4 million putative archaic chromosome fragments that were detected in fully phased whole-genome sequences from 27,566 Icelanders, corresponding to a range of 56,388-112,709 unique archaic fragments that cover 38.0-48.2% of the callable genome. On the basis of the similarity with known archaic genomes, we assign 84.5% of fragments to an Altai or Vindija Neanderthal origin and 3.3% to Denisovan origin; 12.2% of fragments are of unknown origin. We find that Icelanders have more Denisovan-like fragments than expected through incomplete lineage sorting. This is best explained by Denisovan gene flow, either into ancestors of the introgressing Neanderthals or directly into humans. A within-individual, paired comparison of archaic fragments with syntenic non-archaic fragments revealed that, although the overall rate of mutation was similar in humans and Neanderthals during the 500 thousand years that their lineages were separate, there were differences in the relative frequencies of mutation types-perhaps due to different generation intervals for males and females. Finally, we assessed 271 phenotypes, report 5 associations driven by variants in archaic fragments and show that the majority of previously reported associations are better explained by non-archaic variants.


Assuntos
Introgressão Genética/genética , Genoma Humano/genética , Genômica , Mutação , Homem de Neandertal/genética , Animais , Feminino , Estudos de Associação Genética , Haploidia , Humanos , Islândia , Masculino , Fenótipo , Filogenia
7.
Bioinformatics ; 39(8)2023 08 01.
Artigo em Inglês | MEDLINE | ID: mdl-37535674

RESUMO

MOTIVATION: Meiotic recombination is the main driving force of human genetic diversity, along with mutations. Recombinations split into crossovers, separating large chromosomal regions originating from different homologous chromosomes, and non-crossovers (NCOs), where a small segment from one chromosome is embedded in a region originating from the homologous chromosome. NCOs are much less studied than mutations and crossovers as NCOs are short and can only be detected at markers heterozygous in the transmitting parent, leaving most of them undetectable. RESULTS: The detectable NCOs, known as gene conversions, hide information about NCOs, including their number and length, waiting to be unveiled. We introduce NCOurd, software, and algorithm, based on an expectation-maximization algorithm, to estimate the number of NCOs and their length distribution from gene conversion data. AVAILABILITY AND IMPLEMENTATION: https://github.com/DecodeGenetics/NCOurd.


Assuntos
Troca Genética , Conversão Gênica , Humanos , Heterozigoto , Meiose
8.
Bioinformatics ; 38(3): 604-611, 2022 01 12.
Artigo em Inglês | MEDLINE | ID: mdl-34726732

RESUMO

MOTIVATION: With the increasing throughput of sequencing technologies, structural variant (SV) detection has become possible across tens of thousands of genomes. Non-reference sequence (NRS) variants have drawn less attention compared with other types of SVs due to the computational complexity of detecting them. When using short-read data, the detection of NRS variants inevitably involves a de novo assembly which requires high-quality sequence data at high coverage. Previous studies have demonstrated how sequence data of multiple genomes can be combined for the reliable detection of NRS variants. However, the algorithms proposed in these studies have limited scalability to larger sets of genomes. RESULTS: We introduce PopIns2, a tool to discover and characterize NRS variants in many genomes, which scales to considerably larger numbers of genomes than its predecessor PopIns. In this article, we briefly outline the PopIns2 workflow and highlight our novel algorithmic contributions. We developed an entirely new approach for merging contig assemblies of unaligned reads from many genomes into a single set of NRS using a colored de Bruijn graph. Our tests on simulated data indicate that the new merging algorithm ranks among the best approaches in terms of quality and reliability and that PopIns2 shows the best precision for a growing number of genomes processed. Results on the Polaris Diversity Cohort and a set of 1000 Icelandic human genomes demonstrate unmatched scalability for the application on population-scale datasets. AVAILABILITY AND IMPLEMENTATION: The source code of PopIns2 is available from https://github.com/kehrlab/PopIns2. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Algoritmos , Software , Humanos , Análise de Sequência de DNA/métodos , Reprodutibilidade dos Testes , Genoma Humano , Sequenciamento de Nucleotídeos em Larga Escala/métodos
9.
Nature ; 549(7673): 519-522, 2017 09 28.
Artigo em Inglês | MEDLINE | ID: mdl-28959963

RESUMO

The characterization of mutational processes that generate sequence diversity in the human genome is of paramount importance both to medical genetics and to evolutionary studies. To understand how the age and sex of transmitting parents affect de novo mutations, here we sequence 1,548 Icelanders, their parents, and, for a subset of 225, at least one child, to 35× genome-wide coverage. We find 108,778 de novo mutations, both single nucleotide polymorphisms and indels, and determine the parent of origin of 42,961. The number of de novo mutations from mothers increases by 0.37 per year of age (95% CI 0.32-0.43), a quarter of the 1.51 per year from fathers (95% CI 1.45-1.57). The number of clustered mutations increases faster with the mother's age than with the father's, and the genomic span of maternal de novo mutation clusters is greater than that of paternal ones. The types of de novo mutation from mothers change substantially with age, with a 0.26% (95% CI 0.19-0.33%) decrease in cytosine-phosphate-guanine to thymine-phosphate-guanine (CpG>TpG) de novo mutations and a 0.33% (95% CI 0.28-0.38%) increase in C>G de novo mutations per year, respectively. Remarkably, these age-related changes are not distributed uniformly across the genome. A striking example is a 20 megabase region on chromosome 8p, with a maternal C>G mutation rate that is up to 50-fold greater than the rest of the genome. The age-related accumulation of maternal non-crossover gene conversions also mostly occurs within these regions. Increased sequence diversity and linkage disequilibrium of C>G variants within regions affected by excess maternal mutations indicate that the underlying mutational process has persisted in humans for thousands of years. Moreover, the regional excess of C>G variation in humans is largely shared by chimpanzees, less by gorillas, and is almost absent from orangutans. This demonstrates that sequence diversity in humans results from evolving interactions between age, sex, mutation type, and genomic location.


Assuntos
Envelhecimento/genética , Mutação em Linhagem Germinativa/genética , Idade Materna , Mutagênese , Pais , Idade Paterna , Adolescente , Adulto , Idoso , Animais , Criança , Cromossomos Humanos Par 8/genética , Evolução Molecular , Feminino , Sequência Rica em GC , Genoma Humano/genética , Gorilla gorilla/genética , Humanos , Mutação INDEL , Islândia , Desequilíbrio de Ligação/genética , Masculino , Pessoa de Meia-Idade , Taxa de Mutação , Pan troglodytes/genética , Polimorfismo de Nucleotídeo Único , Pongo/genética , Adulto Jovem
11.
Bioinformatics ; 37(15): 2215-2217, 2021 08 09.
Artigo em Inglês | MEDLINE | ID: mdl-33135043

RESUMO

MOTIVATION: Data analysis is requisite on reliable data. In genetics this includes verifying that the sample is not contaminated with another, a problem ubiquitous in biology. RESULTS: In human, and other diploid species, DNA contamination from the same species can be found by the presence of three haplotypes between polymorphic SNPs. read_haps is a tool that detects sample contamination from short read whole genome sequencing data. AVAILABILITYAND IMPLEMENTATION: github.com/DecodeGenetics/read_haps.


Assuntos
Diploide , Sequenciamento de Nucleotídeos em Larga Escala , Sequência de Bases , Haplótipos , Humanos , Análise de Sequência de DNA , Software , Sequenciamento Completo do Genoma
12.
Arterioscler Thromb Vasc Biol ; 41(10): 2616-2628, 2021 10.
Artigo em Inglês | MEDLINE | ID: mdl-34407635

RESUMO

Objective: Familial hypercholesterolemia (FH) is traditionally defined as a monogenic disease characterized by severely elevated LDL-C (low-density lipoprotein cholesterol) levels. In practice, FH is commonly a clinical diagnosis without confirmation of a causative mutation. In this study, we sought to characterize and compare monogenic and clinically defined FH in a large sample of Icelanders. Approach and Results: We whole-genome sequenced 49 962 Icelanders and imputed the identified variants into an overall sample of 166 281 chip-genotyped Icelanders. We identified 20 FH mutations in LDLR, APOB, and PCSK9 with combined prevalence of 1 in 836. Monogenic FH was associated with severely elevated LDL-C levels and increased risk of premature coronary disease, aortic valve stenosis, and high burden of coronary atherosclerosis. We used a modified version of the Dutch Lipid Clinic Network criteria to screen for the clinical FH phenotype among living adult participants (N=79 058). Clinical FH was found in 2.2% of participants, of whom only 5.2% had monogenic FH. Mutation-negative clinical FH has a strong polygenic basis. Both individuals with monogenic FH and individuals with mutation-negative clinical FH were markedly undertreated with cholesterol-lowering medications and only a minority attained an LDL-C target of <2.6 mmol/L (<100 mg/dL; 11.0% and 24.9%, respectively) or <1.8 mmol/L (<70 mg/dL; 0.0% and 5.2%, respectively), as recommended for primary prevention by European Society of Cardiology/European Atherosclerosis Society cholesterol guidelines. Conclusions: Clinically defined FH is a relatively common phenotype that is explained by monogenic FH in only a minority of cases. Both monogenic and clinical FH confer high cardiovascular risk but are markedly undertreated.


Assuntos
Apolipoproteína B-100/genética , Doenças Cardiovasculares/genética , Hiperlipoproteinemia Tipo II/genética , Lipídeos/sangue , Mutação , Pró-Proteína Convertase 9/genética , Receptores de LDL/genética , Adulto , Idoso , Idoso de 80 Anos ou mais , Biomarcadores/sangue , Doenças Cardiovasculares/diagnóstico , Doenças Cardiovasculares/etnologia , Doenças Cardiovasculares/terapia , Feminino , Estudos de Associação Genética , Predisposição Genética para Doença , Humanos , Inibidores de Hidroximetilglutaril-CoA Redutases/uso terapêutico , Hiperlipoproteinemia Tipo II/diagnóstico , Hiperlipoproteinemia Tipo II/tratamento farmacológico , Hiperlipoproteinemia Tipo II/etnologia , Islândia/epidemiologia , Masculino , Pessoa de Meia-Idade , Fenótipo , Prevalência , Prognóstico , Medição de Risco , Fatores de Risco , Adulto Jovem
13.
Hum Mol Genet ; 28(7): 1199-1211, 2019 04 01.
Artigo em Inglês | MEDLINE | ID: mdl-30476138

RESUMO

Urine dipstick tests are widely used in routine medical care to diagnose kidney and urinary tract and metabolic diseases. Several environmental factors are known to affect the test results, whereas the effects of genetic diversity are largely unknown. We tested 32.5 million sequence variants for association with urinary biomarkers in a set of 150 274 Icelanders with urine dipstick measurements. We detected 20 association signals, of which 14 are novel, associating with at least one of five clinical entities defined by the urine dipstick: glucosuria, ketonuria, proteinuria, hematuria and urine pH. These include three independent glucosuria variants at SLC5A2, the gene encoding the sodium-dependent glucose transporter (SGLT2), a protein targeted pharmacologically to increase urinary glucose excretion in the treatment of diabetes. Two variants associating with proteinuria are in LRP2 and CUBN, encoding the co-transporters megalin and cubilin, respectively, that mediate proximal tubule protein uptake. One of the hematuria-associated variants is a rare, previously unreported 2.5 kb exonic deletion in COL4A3. Of the four signals associated with urine pH, we note that the pH-increasing alleles of two variants (POU2AF1, WDR72) associate significantly with increased risk of kidney stones. Our results reveal that genetic factors affect variability in urinary biomarkers, in both a disease dependent and independent context.


Assuntos
Biomarcadores/análise , Biomarcadores/urina , Variação Genética/genética , Adulto , Idoso , Alelos , Feminino , Hematúria/genética , Hematúria/urina , Humanos , Concentração de Íons de Hidrogênio , Islândia , Cetose/genética , Cetose/urina , Rim/metabolismo , Masculino , Pessoa de Meia-Idade , Proteinúria/genética , Proteinúria/urina , Transportador 2 de Glucose-Sódio/genética , Sequenciamento Completo do Genoma/métodos
14.
Bioinformatics ; 36(7): 2269-2271, 2020 04 01.
Artigo em Inglês | MEDLINE | ID: mdl-31804671

RESUMO

SUMMARY: popSTR2 is an update and augmentation of our previous work 'popSTR: a population-based microsatellite genotyper'. To make genotyping sensitive to inter-sample differences, we supply a kernel to estimate sample-specific slippage rates. For clinical sequencing purposes, a panel of known pathogenic repeat expansions is provided along with a script that scans and flags for manual inspection markers indicative of a pathogenic expansion. Like its predecessor, popSTR2 allows for joint genotyping of samples at a population scale. We now provide a binning method that makes the microsatellite genotypes more amenable to analysis within standard association pipelines and can increase association power. AVAILABILITY AND IMPLEMENTATION: https://github.com/DecodeGenetics/popSTR. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Repetições de Microssatélites , Software , Genótipo
15.
BMC Med Inform Decis Mak ; 19(1): 27, 2019 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-30709348

RESUMO

BACKGROUND: Although osteoporosis is an easily diagnosed and treatable condition, many individuals remain untreated. Clinical decision support systems might increase appropriate treatment of osteoporosis. We designed the Osteoporosis Advisor (OPAD), a computerized tool to support physicians managing osteoporosis at the point-of-care. The present study compares the treatment recommendations provided by OPAD, an expert physician and the National Osteoporosis Guideline Group (NOGG). METHODS: We performed a retrospective analysis of 259 patients attending the outpatient osteoporosis clinic at the University Hospital in Iceland. We entered each patient's data into the OPAD and recorded the OPAD diagnostic comments, 10-year risk of major osteoporotic fracture and treatment options. We compared OPAD recommendations to those given by the osteoporosis specialist, and to those of the NOGG. RESULTS: Risk estimates made by OPAD were highly correlated with those from FRAX (r = 0.99, 95% CI 0.99, 1.00 without femoral neck BMD; r = 0.98, 95% CI, 0.97, 0.99 with femoral neck BMD. Reassurance was recommended by the expert, NOGG and the OPAD in 68, 63 and 52% of cases, respectively. Likewise, intervention was recommended by the expert, NOGG, and the OPAD in 32, 37 and 48% of cases, respectively. The OPAD demonstrated moderate agreement with the physician (kappa 0.51, 95% CI 0.41, 0.61) and even higher agreement with NOGG (kappa 0.69, 95% CI 0.60, 0.77). CONCLUSION: Primary care physicians can use the OPAD to assess and treat patients' skeletal health. Recommendations given by OPAD are consistent with expert opinion and existing guidelines.


Assuntos
Sistemas de Apoio a Decisões Clínicas/normas , Osteologia/métodos , Osteoporose/diagnóstico , Osteoporose/terapia , Guias de Prática Clínica como Assunto/normas , Medição de Risco/normas , Idoso , Feminino , Humanos , Pessoa de Meia-Idade , Médicos de Atenção Primária , Projetos Piloto , Sistemas Automatizados de Assistência Junto ao Leito , Estudos Retrospectivos
16.
J Cell Mol Med ; 22(3): 1574-1582, 2018 03.
Artigo em Inglês | MEDLINE | ID: mdl-29266682

RESUMO

To find sequence variants affecting prostate cancer (PCA) susceptibility in an unscreened Romanian population we use a genome-wide association study (GWAS). The study population included 990 unrelated pathologically confirmed PCA cases and 1034 male controls. DNA was genotyped using Illumina SNP arrays, and 24.295.558 variants were imputed using the 1000 Genomes data set. An association test was performed between the imputed markers and PCA. A systematic literature review for variants associated with PCA risk identified 115 unique variants that were tested in the Romanian sample set. Thirty of the previously reported SNPs replicated (P-value < 0.05), with the strongest associations observed at: 8q24.21, 11q13.3, 6q25.3, 5p15.33, 22q13.2, 17q12 and 3q13.2. The replicated variants showing the most significant association in Romania are rs1016343 at 8q24.21 (P = 2.2 × 10-4 ), rs7929962 at 11q13.3 (P = 2.7 × 10-4 ) and rs9364554 at 6q25.2 (P = 4.7 × 10-4 ). None of the variants tested in the Romanian GWAS reached genome-wide significance (P-value <5 × 10-8 ) but 807 markers had P-values <1 × 10-4 . Here, we report the results of the first GWAS of PCA performed in a Romanian population. Our study provides evidence that a substantial fraction of previously validated PCA variants associate with risk in this unscreened Romanian population.


Assuntos
Biomarcadores Tumorais/genética , Loci Gênicos , Predisposição Genética para Doença , Polimorfismo de Nucleotídeo Único , Antígeno Prostático Específico/genética , Neoplasias da Próstata/diagnóstico , Idoso , Idoso de 80 Anos ou mais , Alelos , Biomarcadores Tumorais/sangue , Estudos de Casos e Controles , Perfilação da Expressão Gênica , Frequência do Gene , Genoma Humano , Estudo de Associação Genômica Ampla , Humanos , Masculino , Pessoa de Meia-Idade , Estadiamento de Neoplasias , Análise de Sequência com Séries de Oligonucleotídeos , Antígeno Prostático Específico/sangue , Neoplasias da Próstata/sangue , Neoplasias da Próstata/genética , Neoplasias da Próstata/patologia , Risco , Romênia
17.
J Cell Mol Med ; 22(12): 6068-6076, 2018 12.
Artigo em Inglês | MEDLINE | ID: mdl-30324682

RESUMO

Two familial forms of colorectal cancer (CRC), Lynch syndrome (LS) and familial adenomatous polyposis (FAP), are caused by rare mutations in DNA mismatch repair genes (MLH1, MSH2, MSH6, PMS2) and the genes APC and MUTYH, respectively. No information is available on the presence of high-risk CRC mutations in the Romanian population. We performed whole-genome sequencing of 61 Romanian CRC cases with a family history of cancer and/or early onset of disease, focusing the analysis on candidate variants in the LS and FAP genes. The frequencies of all candidate variants were assessed in a cohort of 688 CRC cases and 4567 controls. Immunohistochemical (IHC) staining for MLH1, MSH2, MSH6, and PMS2 was performed on tumour tissue. We identified 11 candidate variants in 11 cases; six variants in MLH1, one in MSH6, one in PMS2, and three in APC. Combining information on the predicted impact of the variants on the proteins, IHC results and previous reports, we found three novel pathogenic variants (MLH1:p.Lys84ThrfsTer4, MLH1:p.Ala586CysfsTer7, PMS2:p.Arg211ThrfsTer38), and two novel variants that are unlikely to be pathogenic. Also, we confirmed three previously published pathogenic LS variants and suggest to reclassify a previously reported variant of uncertain significance to pathogenic (MLH1:c.1559-1G>C).


Assuntos
Polipose Adenomatosa do Colo/genética , Neoplasias Colorretais Hereditárias sem Polipose/genética , Reparo de Erro de Pareamento de DNA/genética , Predisposição Genética para Doença , Polipose Adenomatosa do Colo/epidemiologia , Polipose Adenomatosa do Colo/patologia , Adulto , Idoso , Neoplasias Colorretais Hereditárias sem Polipose/epidemiologia , Neoplasias Colorretais Hereditárias sem Polipose/patologia , DNA Glicosilases/genética , Metilação de DNA/genética , Proteínas de Ligação a DNA/genética , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Endonuclease PMS2 de Reparo de Erro de Pareamento/genética , Proteína 1 Homóloga a MutL/genética , Proteína 2 Homóloga a MutS/genética , Mutação , Fatores de Risco , Romênia/epidemiologia
18.
Hum Mol Genet ; 25(5): 1008-18, 2016 Mar 01.
Artigo em Inglês | MEDLINE | ID: mdl-26740556

RESUMO

Transcriptional and splicing anomalies have been observed in intron 8 of the CASP8 gene (encoding procaspase-8) in association with cutaneous basal-cell carcinoma (BCC) and linked to a germline SNP rs700635. Here, we show that the rs700635[C] allele, which is associated with increased risk of BCC and breast cancer, is protective against prostate cancer [odds ratio (OR) = 0.91, P = 1.0 × 10(-6)]. rs700635[C] is also associated with failures to correctly splice out CASP8 intron 8 in breast and prostate tumours and in corresponding normal tissues. Investigation of rs700635[C] carriers revealed that they have a human-specific short interspersed element-variable number of tandem repeat-Alu (SINE-VNTR-Alu), subfamily-E retrotransposon (SVA-E) inserted into CASP8 intron 8. The SVA-E shows evidence of prior activity, because it has transduced some CASP8 sequences during subsequent retrotransposition events. Whole-genome sequence (WGS) data were used to tag the SVA-E with a surrogate SNP rs1035142[T] (r(2) = 0.999), which showed associations with both the splicing anomalies (P = 6.5 × 10(-32)) and with protection against prostate cancer (OR = 0.91, P = 3.8 × 10(-7)).


Assuntos
Neoplasias da Mama/genética , Carcinoma Basocelular/genética , Caspase 8/genética , Neoplasias da Próstata/genética , Splicing de RNA , Retroelementos , Neoplasias Cutâneas/genética , Adulto , Idoso , Idoso de 80 Anos ou mais , Alelos , Sequência de Bases , Neoplasias da Mama/metabolismo , Neoplasias da Mama/patologia , Carcinoma Basocelular/metabolismo , Carcinoma Basocelular/patologia , Caspase 8/metabolismo , Feminino , Estudo de Associação Genômica Ampla , Humanos , Íntrons , Masculino , Pessoa de Meia-Idade , Dados de Sequência Molecular , Razão de Chances , Polimorfismo de Nucleotídeo Único , Neoplasias da Próstata/metabolismo , Neoplasias da Próstata/patologia , Neoplasias da Próstata/prevenção & controle , Fatores de Proteção , Neoplasias Cutâneas/metabolismo , Neoplasias Cutâneas/patologia
19.
Bioinformatics ; 33(24): 4041-4048, 2017 Dec 15.
Artigo em Inglês | MEDLINE | ID: mdl-27591079

RESUMO

MOTIVATION: Microsatellites, also known as short tandem repeats (STRs), are tracts of repetitive DNA sequences containing motifs ranging from two to six bases. Microsatellites are one of the most abundant type of variation in the human genome, after single nucleotide polymorphisms (SNPs) and Indels. Microsatellite analysis has a wide range of applications, including medical genetics, forensics and construction of genetic genealogy. However, microsatellite variations are rarely considered in whole-genome sequencing studies, in large due to a lack of tools capable of analyzing them. RESULTS: Here we present a microsatellite genotyper, optimized for Illumina WGS data, which is both faster and more accurate than other methods previously presented. There are two main ingredients to our improvements. First we reduce the amount of sequencing data necessary for creating microsatellite profiles by using previously aligned sequencing data. Second, we use population information to train microsatellite and individual specific error profiles. By comparing our genotyping results to genotypes generated by capillary electrophoresis we show that our error rates are 50% lower than those of lobSTR, another program specifically developed to determine microsatellite genotypes. AVAILABILITY AND IMPLEMENTATION: Source code is available on Github: https://github.com/DecodeGenetics/popSTR. CONTACT: snaedis.kristmundsdottir@decode.is or bjarni.halldorsson@decode.is.


Assuntos
Repetições de Microssatélites , Genótipo , Humanos , Software , Sequenciamento Completo do Genoma
20.
Bioinformatics ; 32(7): 961-7, 2016 04 01.
Artigo em Inglês | MEDLINE | ID: mdl-25926346

RESUMO

MOTIVATION: The detection of genomic structural variation (SV) has advanced tremendously in recent years due to progress in high-throughput sequencing technologies. Novel sequence insertions, insertions without similarity to a human reference genome, have received less attention than other types of SVs due to the computational challenges in their detection from short read sequencing data, which inherently involves de novo assembly. De novo assembly is not only computationally challenging, but also requires high-quality data. Although the reads from a single individual may not always meet this requirement, using reads from multiple individuals can increase power to detect novel insertions. RESULTS: We have developed the program PopIns, which can discover and characterize non-reference insertions of 100 bp or longer on a population scale. In this article, we describe the approach we implemented in PopIns. It takes as input a reads-to-reference alignment, assembles unaligned reads using a standard assembly tool, merges the contigs of different individuals into high-confidence sequences, anchors the merged sequences into the reference genome, and finally genotypes all individuals for the discovered insertions. Our tests on simulated data indicate that the merging step greatly improves the quality and reliability of predicted insertions and that PopIns shows significantly better recall and precision than the recent tool MindTheGap. Preliminary results on a dataset of 305 Icelanders demonstrate the practicality of the new approach. AVAILABILITY AND IMPLEMENTATION: The source code of PopIns is available from http://github.com/bkehr/popins CONTACT: birte.kehr@decode.is SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Biologia Computacional/métodos , Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de DNA , Variação Estrutural do Genoma , Humanos , Mutagênese Insercional , Reprodutibilidade dos Testes
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA