Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 12 de 12
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Genome Biol ; 25(1): 69, 2024 Mar 11.
Artigo em Inglês | MEDLINE | ID: mdl-38468278

RESUMO

BACKGROUND: Long-read sequencing can enable the detection of base modifications, such as CpG methylation, in single molecules of DNA. The most commonly used methods for long-read sequencing are nanopore developed by Oxford Nanopore Technologies (ONT) and single molecule real-time (SMRT) sequencing developed by Pacific Bioscience (PacBio). In this study, we systematically compare the performance of CpG methylation detection from long-read sequencing. RESULTS: We demonstrate that CpG methylation detection from 7179 nanopore-sequenced DNA samples is highly accurate and consistent with 132 oxidative bisulfite-sequenced (oxBS) samples, isolated from the same blood draws. We introduce quality filters for CpGs that further enhance the accuracy of CpG methylation detection from nanopore-sequenced DNA, while removing at most 30% of CpGs. We evaluate the per-site performance of CpG methylation detection across different genomic features and CpG methylation rates and demonstrate how the latest R10.4 flowcell chemistry and base-calling algorithms improve methylation detection from nanopore sequencing. Additionally, we show how the methylation detection of 50 SMRT-sequenced genomes compares to nanopore sequencing and oxBS. CONCLUSIONS: This study provides the first systematic comparison of CpG methylation detection tools for long-read sequencing methods. We compare two commonly used computational methods for the detection of CpG methylation in a large number of nanopore genomes, including samples sequenced using the latest R10.4 nanopore flowcell chemistry and 50 SMRT sequenced samples. We provide insights into the strengths and limitations of each sequencing method as well as recommendations for standardization and evaluation of tools designed for genome-scale modified base detection using long-read sequencing.


Assuntos
Metilação de DNA , Genoma Humano , Humanos , Análise de Sequência de DNA/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , DNA
2.
Bioinformatics ; 39(8)2023 08 01.
Artigo em Inglês | MEDLINE | ID: mdl-37535674

RESUMO

MOTIVATION: Meiotic recombination is the main driving force of human genetic diversity, along with mutations. Recombinations split into crossovers, separating large chromosomal regions originating from different homologous chromosomes, and non-crossovers (NCOs), where a small segment from one chromosome is embedded in a region originating from the homologous chromosome. NCOs are much less studied than mutations and crossovers as NCOs are short and can only be detected at markers heterozygous in the transmitting parent, leaving most of them undetectable. RESULTS: The detectable NCOs, known as gene conversions, hide information about NCOs, including their number and length, waiting to be unveiled. We introduce NCOurd, software, and algorithm, based on an expectation-maximization algorithm, to estimate the number of NCOs and their length distribution from gene conversion data. AVAILABILITY AND IMPLEMENTATION: https://github.com/DecodeGenetics/NCOurd.


Assuntos
Troca Genética , Conversão Gênica , Humanos , Heterozigoto , Meiose
3.
Nat Commun ; 14(1): 3855, 2023 06 29.
Artigo em Inglês | MEDLINE | ID: mdl-37386006

RESUMO

Microsatellites are polymorphic tracts of short tandem repeats with one to six base-pair (bp) motifs and are some of the most polymorphic variants in the genome. Using 6084 Icelandic parent-offspring trios we estimate 63.7 (95% CI: 61.9-65.4) microsatellite de novo mutations (mDNMs) per offspring per generation, excluding one bp repeats motifs (homopolymers) the estimate is 48.2 mDNMs (95% CI: 46.7-49.6). Paternal mDNMs occur at longer repeats than maternal ones, which are in turn larger with a mean size of 3.4 bp vs 3.1 bp for paternal ones. mDNMs increase by 0.97 (95% CI: 0.90-1.04) and 0.31 (95% CI: 0.25-0.37) per year of father's and mother's age at conception, respectively. Here, we find two independent coding variants that associate with the number of mDNMs transmitted to offspring; The minor allele of a missense variant (allele frequency (AF) = 1.9%) in MSH2, a mismatch repair gene, increases transmitted mDNMs from both parents (effect: 13.1 paternal and 7.8 maternal mDNMs). A synonymous variant (AF = 20.3%) in NEIL2, a DNA damage repair gene, increases paternally transmitted mDNMs (effect: 4.4 mDNMs). Thus, the microsatellite mutation rate in humans is in part under genetic control.


Assuntos
Reparo de Erro de Pareamento de DNA , Mutação em Linhagem Germinativa , Humanos , Alelos , Mutação em Linhagem Germinativa/genética , Repetições de Microssatélites/genética , Células Germinativas
4.
Sci Adv ; 9(23): eabq2969, 2023 06 09.
Artigo em Inglês | MEDLINE | ID: mdl-37294764

RESUMO

The genetic basis of the human vocal system is largely unknown, as are the sequence variants that give rise to individual differences in voice and speech. Here, we couple data on diversity in the sequence of the genome with voice and vowel acoustics in speech recordings from 12,901 Icelanders. We show how voice pitch and vowel acoustics vary across the life span and correlate with anthropometric, physiological, and cognitive traits. We found that voice pitch and vowel acoustics have a heritable component and discovered correlated common variants in ABCC9 that associate with voice pitch. The ABCC9 variants also associate with adrenal gene expression and cardiovascular traits. By showing that voice and vowel acoustics are influenced by genetics, we have taken important steps toward understanding the genetics and evolution of the human vocal system.


Assuntos
Acústica da Fala , Voz , Humanos , Fala/fisiologia , Acústica
5.
Nature ; 607(7920): 732-740, 2022 07.
Artigo em Inglês | MEDLINE | ID: mdl-35859178

RESUMO

Detailed knowledge of how diversity in the sequence of the human genome affects phenotypic diversity depends on a comprehensive and reliable characterization of both sequences and phenotypic variation. Over the past decade, insights into this relationship have been obtained from whole-exome sequencing or whole-genome sequencing of large cohorts with rich phenotypic data1,2. Here we describe the analysis of whole-genome sequencing of 150,119 individuals from the UK Biobank3. This constitutes a set of high-quality variants, including 585,040,410 single-nucleotide polymorphisms, representing 7.0% of all possible human single-nucleotide polymorphisms, and 58,707,036 indels. This large set of variants allows us to characterize selection based on sequence variation within a population through a depletion rank score of windows along the genome. Depletion rank analysis shows that coding exons represent a small fraction of regions in the genome subject to strong sequence conservation. We define three cohorts within the UK Biobank: a large British Irish cohort, a smaller African cohort and a South Asian cohort. A haplotype reference panel is provided that allows reliable imputation of most variants carried by three or more sequenced individuals. We identified 895,055 structural variants and 2,536,688 microsatellites, groups of variants typically excluded from large-scale whole-genome sequencing studies. Using this formidable new resource, we provide several examples of trait associations for rare variants with large effects not found previously through studies based on whole-exome sequencing and/or imputation.


Assuntos
Bancos de Espécimes Biológicos , Bases de Dados Genéticas , Variação Genética , Genoma Humano , Genômica , Sequenciamento Completo do Genoma , África/etnologia , Ásia/etnologia , Estudos de Coortes , Sequência Conservada , Éxons/genética , Genoma Humano/genética , Haplótipos/genética , Humanos , Mutação INDEL , Irlanda/etnologia , Repetições de Microssatélites , Polimorfismo de Nucleotídeo Único/genética , Reino Unido
6.
Nat Genet ; 53(6): 779-786, 2021 06.
Artigo em Inglês | MEDLINE | ID: mdl-33972781

RESUMO

Long-read sequencing (LRS) promises to improve the characterization of structural variants (SVs). We generated LRS data from 3,622 Icelanders and identified a median of 22,636 SVs per individual (a median of 13,353 insertions and 9,474 deletions). We discovered a set of 133,886 reliably genotyped SV alleles and imputed them into 166,281 individuals to explore their effects on diseases and other traits. We discovered an association of a rare deletion in PCSK9 with lower low-density lipoprotein (LDL) cholesterol levels, compared to the population average. We also discovered an association of a multiallelic SV in ACAN with height; we found 11 alleles that differed in the number of a 57-bp-motif repeat and observed a linear relationship between the number of repeats carried and height. These results show that SVs can be accurately characterized at the population scale using LRS data in a genome-wide non-targeted approach and demonstrate how SVs impact phenotypes.


Assuntos
Doença/genética , Variação Estrutural do Genoma , Sequenciamento de Nucleotídeos em Larga Escala , Característica Quantitativa Herdável , Alelos , LDL-Colesterol/metabolismo , Cromossomos Humanos/genética , Feminino , Frequência do Gene/genética , Humanos , Islândia , Modelos Lineares , Masculino , Pró-Proteína Convertase 9/genética , Recombinação Genética/genética , Deleção de Sequência/genética
7.
Nat Commun ; 10(1): 5402, 2019 11 27.
Artigo em Inglês | MEDLINE | ID: mdl-31776332

RESUMO

Analysis of sequence diversity in the human genome is fundamental for genetic studies. Structural variants (SVs) are frequently omitted in sequence analysis studies, although each has a relatively large impact on the genome. Here, we present GraphTyper2, which uses pangenome graphs to genotype SVs and small variants using short-reads. Comparison to the syndip benchmark dataset shows that our SV genotyping is sensitive and variant segregation in families demonstrates the accuracy of our approach. We demonstrate that incorporating public assembly data into our pipeline greatly improves sensitivity, particularly for large insertions. We validate 6,812 SVs on average per genome using long-read data of 41 Icelanders. We show that GraphTyper2 can simultaneously genotype tens of thousands of whole-genomes by characterizing 60 million small variants and half a million SVs in 49,962 Icelanders, including 80 thousand SVs with high-confidence.


Assuntos
Genoma Humano , Variação Estrutural do Genoma , Técnicas de Genotipagem/métodos , Software , Gráficos por Computador , Bases de Dados Genéticas , Genética Populacional , Técnicas de Genotipagem/estatística & dados numéricos , Humanos , Islândia , Linhagem , Polimorfismo de Nucleotídeo Único , Reprodutibilidade dos Testes , Fluxo de Trabalho
8.
Science ; 363(6425)2019 Jan 25.
Artigo em Inglês | MEDLINE | ID: mdl-30679340

RESUMO

Genetic diversity arises from recombination and de novo mutation (DNM). Using a combination of microarray genotype and whole-genome sequence data on parent-child pairs, we identified 4,531,535 crossover recombinations and 200,435 DNMs. The resulting genetic map has a resolution of 682 base pairs. Crossovers exhibit a mutagenic effect, with overrepresentation of DNMs within 1 kilobase of crossovers in males and females. In females, a higher mutation rate is observed up to 40 kilobases from crossovers, particularly for complex crossovers, which increase with maternal age. We identified 35 loci associated with the recombination rate or the location of crossovers, demonstrating extensive genetic control of meiotic recombination, and our results highlight genes linked to the formation of the synaptonemal complex as determinants of crossovers.


Assuntos
Troca Genética , Análise Mutacional de DNA , Taxa de Mutação , Mapeamento Cromossômico , Epigênese Genética , Feminino , Estudo de Associação Genômica Ampla , Técnicas de Genotipagem , Humanos , Islândia , Masculino , Idade Materna , Análise de Sequência com Séries de Oligonucleotídeos , Polimorfismo de Nucleotídeo Único , Complexo Sinaptonêmico
9.
Nat Genet ; 50(11): 1616, 2018 11.
Artigo em Inglês | MEDLINE | ID: mdl-30237445

RESUMO

In the version of this article published, statements about the impact of insertions and deletions on gene conversions were incorrect. We reported a bias toward deletions, whereas in fact the bias was toward insertions. We are deeply indebted to Laurent Duret and Brice Letcher for noticing this mistake in our manuscript. The following statements are incorrect in the published manuscript.

10.
Sci Data ; 4: 170115, 2017 09 21.
Artigo em Inglês | MEDLINE | ID: mdl-28933420

RESUMO

Understanding of sequence diversity is the cornerstone of analysis of genetic disorders, population genetics, and evolutionary biology. Here, we present an update of our sequencing set to 15,220 Icelanders who we sequenced to an average genome-wide coverage of 34X. We identified 39,020,168 autosomal variants passing GATK filters: 31,079,378 SNPs and 7,940,790 indels. Calling de novo mutations (DNMs) is a formidable challenge given the high false positive rate in sequencing datasets relative to the mutation rate. Here we addressed this issue by using segregation of alleles in three-generation families. Using this transmission assay, we controlled the false positive rate and identified 108,778 high quality DNMs. Furthermore, we used our extended family structure and read pair tracing of DNMs to a panel of phased SNPs, to determine the parent of origin of 42,961 DNMs.


Assuntos
Genoma Humano , Humanos , Mutação INDEL , Islândia , Polimorfismo de Nucleotídeo Único
11.
Nature ; 549(7673): 519-522, 2017 09 28.
Artigo em Inglês | MEDLINE | ID: mdl-28959963

RESUMO

The characterization of mutational processes that generate sequence diversity in the human genome is of paramount importance both to medical genetics and to evolutionary studies. To understand how the age and sex of transmitting parents affect de novo mutations, here we sequence 1,548 Icelanders, their parents, and, for a subset of 225, at least one child, to 35× genome-wide coverage. We find 108,778 de novo mutations, both single nucleotide polymorphisms and indels, and determine the parent of origin of 42,961. The number of de novo mutations from mothers increases by 0.37 per year of age (95% CI 0.32-0.43), a quarter of the 1.51 per year from fathers (95% CI 1.45-1.57). The number of clustered mutations increases faster with the mother's age than with the father's, and the genomic span of maternal de novo mutation clusters is greater than that of paternal ones. The types of de novo mutation from mothers change substantially with age, with a 0.26% (95% CI 0.19-0.33%) decrease in cytosine-phosphate-guanine to thymine-phosphate-guanine (CpG>TpG) de novo mutations and a 0.33% (95% CI 0.28-0.38%) increase in C>G de novo mutations per year, respectively. Remarkably, these age-related changes are not distributed uniformly across the genome. A striking example is a 20 megabase region on chromosome 8p, with a maternal C>G mutation rate that is up to 50-fold greater than the rest of the genome. The age-related accumulation of maternal non-crossover gene conversions also mostly occurs within these regions. Increased sequence diversity and linkage disequilibrium of C>G variants within regions affected by excess maternal mutations indicate that the underlying mutational process has persisted in humans for thousands of years. Moreover, the regional excess of C>G variation in humans is largely shared by chimpanzees, less by gorillas, and is almost absent from orangutans. This demonstrates that sequence diversity in humans results from evolving interactions between age, sex, mutation type, and genomic location.


Assuntos
Envelhecimento/genética , Mutação em Linhagem Germinativa/genética , Idade Materna , Mutagênese , Pais , Idade Paterna , Adolescente , Adulto , Idoso , Animais , Criança , Cromossomos Humanos Par 8/genética , Evolução Molecular , Feminino , Sequência Rica em GC , Genoma Humano/genética , Gorilla gorilla/genética , Humanos , Mutação INDEL , Islândia , Desequilíbrio de Ligação/genética , Masculino , Pessoa de Meia-Idade , Taxa de Mutação , Pan troglodytes/genética , Polimorfismo de Nucleotídeo Único , Pongo/genética , Adulto Jovem
12.
Nat Genet ; 48(11): 1377-1384, 2016 11.
Artigo em Inglês | MEDLINE | ID: mdl-27643539

RESUMO

Meiotic recombination involves a combination of gene conversion and crossover events that, along with mutations, produce germline genetic diversity. Here we report the discovery of 3,176 SNP and 61 indel gene conversions. Our estimate of the non-crossover (NCO) gene conversion rate (G) is 7.0 for SNPs and 5.8 for indels per megabase per generation, and the GC bias is 67.6%. For indels, we demonstrate a 65.6% preference for the shorter allele. NCO gene conversions from mothers are longer than those from fathers, and G is 2.17 times greater in mothers. Notably, G increases with the age of mothers, but not the age of fathers. A disproportionate number of NCO gene conversions in older mothers occur outside double-strand break (DSB) regions and in regions with relatively low GC content. This points to age-related changes in the mechanisms of meiotic gene conversion in oocytes.


Assuntos
Conversão Gênica , Meiose , Adulto , Composição de Bases , Criança , Feminino , Humanos , Masculino , Idade Materna , Polimorfismo de Nucleotídeo Único , Caracteres Sexuais
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...