Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 11 de 11
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
bioRxiv ; 2024 Mar 01.
Artigo em Inglês | MEDLINE | ID: mdl-38464144

RESUMO

DNA methylation most commonly occurs as 5-methylcytosine (5-mC) in the human genome and has been associated with human diseases. Recent developments in single-molecule sequencing technologies (Oxford Nanopore Technologies (ONT) and Pacific Biosciences) have enabled readouts of long, native DNA molecules, including cytosine methylation. ONT recently upgraded their Nanopore sequencing chemistry and kits from R9 to the R10 version, which yielded increased accuracy and sequencing throughput. However the effects on methylation detection have not yet been documented. Here we performed a series of computational analyses to characterize differences in Nanopore-based 5mC detection between the ONT R9 and R10 chemistries. We compared 5mC calls in R9 and R10 for three human genome datasets: a cell line, a frontal cortex brain sample, and a blood sample. We performed an in-depth analysis on CpG islands and homopolymer regions, and documented high concordance for methylation detection among sequencing technologies. The strongest correlation was observed between Nanopore R10 and Illumina bisulfite technologies for cell line-derived datasets. Subtle differences in methylation datasets between technologies can impact analysis tools such as differential methylation calling software. Our findings show that comparisons can be drawn between methylation data from different Nanopore chemistries using guided hypotheses. This work will facilitate comparison among Nanopore data cohorts derived using different chemistries from large scale sequencing efforts, such as the NIH CARD Long Read Initiative.

2.
Am J Hum Genet ; 111(3): 544-561, 2024 Mar 07.
Artigo em Inglês | MEDLINE | ID: mdl-38307027

RESUMO

Cervical cancer is caused by human papillomavirus (HPV) infection, has few approved targeted therapeutics, and is the most common cause of cancer death in low-resource countries. We characterized 19 cervical and four head and neck cancer cell lines using long-read DNA and RNA sequencing and identified the HPV types, HPV integration sites, chromosomal alterations, and cancer driver mutations. Structural variation analysis revealed telomeric deletions associated with DNA inversions resulting from breakage-fusion-bridge (BFB) cycles. BFB is a common mechanism of chromosomal alterations in cancer, and our study applies long-read sequencing to this important chromosomal rearrangement type. Analysis of the inversion sites revealed staggered ends consistent with exonuclease digestion of the DNA after breakage. Some BFB events are complex, involving inter- or intra-chromosomal insertions or rearrangements. None of the BFB breakpoints had telomere sequences added to resolve the dicentric chromosomes, and only one BFB breakpoint showed chromothripsis. Five cell lines have a chromosomal region 11q BFB event, with YAP1-BIRC3-BIRC2 amplification. Indeed, YAP1 amplification is associated with a 10-year-earlier age of diagnosis of cervical cancer and is three times more common in African American women. This suggests that individuals with cervical cancer and YAP1-BIRC3-BIRC2 amplification, especially those of African ancestry, might benefit from targeted therapy. In summary, we uncovered valuable insights into the mechanisms and consequences of BFB cycles in cervical cancer using long-read sequencing.


Assuntos
Infecções por Papillomavirus , Neoplasias do Colo do Útero , Feminino , Humanos , Neoplasias do Colo do Útero/genética , Aberrações Cromossômicas , Telômero/genética , DNA
3.
Mov Disord ; 38(12): 2249-2257, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-37926948

RESUMO

BACKGROUND: Parkin RBR E3 ubiquitin-protein ligase (PRKN) mutations are the most common cause of young onset and autosomal recessive Parkinson's disease (PD). PRKN is located in FRA6E, which is one of the common fragile sites in the human genome, making this region prone to structural variants. However, complex structural variants such as inversions of PRKN are seldom reported, suggesting that there are potentially unrevealed complex pathogenic PRKN structural variants. OBJECTIVES: To identify complex structural variants in PRKN using long-read sequencing. METHODS: We investigated the genetic cause of monozygotic twins presenting with a young onset dystonia-parkinsonism using targeted sequencing, whole exome sequencing, multiple ligation probe amplification, and long-read sequencing. We assessed the presence and frequency of complex inversions overlapping PRKN using whole-genome sequencing data of Accelerating Medicines Partnership Parkinson's disease (AMP-PD) and United Kingdom (UK)-Biobank datasets. RESULTS: Multiple ligation probe amplification identified a heterozygous exon three deletion in PRKN and long-read sequencing identified a large novel inversion spanning over 7 Mb, including a large part of the coding DNA sequence of PRKN. We could diagnose the affected subjects as compound heterozygous carriers of PRKN. We analyzed whole genome sequencing data of 43,538 participants of the UK-Biobank and 4941 participants of the AMP-PD datasets. Nine inversions in the UK-Biobank and two in AMP PD were identified and were considered potentially damaging and likely to affect PRKN expression. CONCLUSIONS: This is the first report describing a large 7 Mb inversion involving breakpoints outside of PRKN. This study highlights the importance of using long-read sequencing for structural variant analysis in unresolved young-onset PD cases. © 2023 The Authors. Movement Disorders published by Wiley Periodicals LLC on behalf of International Parkinson and Movement Disorder Society. This article has been contributed to by U.S. Government employees and their work is in the public domain in the USA.


Assuntos
Doença de Parkinson , Transtornos Parkinsonianos , Humanos , Heterozigoto , Mutação/genética , Doença de Parkinson/genética , Transtornos Parkinsonianos/genética , Ubiquitina-Proteína Ligases/genética
4.
medRxiv ; 2023 Aug 21.
Artigo em Inglês | MEDLINE | ID: mdl-37790330

RESUMO

Background: PRKN mutations are the most common cause of young onset and autosomal recessive Parkinson's disease (PD). PRKN is located in FRA6E which is one of the common fragile sites in the human genome, making this region prone to structural variants. However, complex structural variants such as inversions of PRKN are seldom reported, suggesting that there are potentially unrevealed complex pathogenic PRKN structural variants. Objectives: To identify complex structural variants in PRKN using long-read sequencing. Methods: We investigated the genetic cause of monozygotic twins presenting with a young onset dystonia-parkinsonism using targeted sequencing, whole exome sequencing, multiple ligation probe amplification, and long-read. We assessed the presence and frequency of complex inversions overlapping PRKN using whole-genome sequencing data of AMP-PD and UK-Biobank datasets. Results: Multiple ligation probe amplification identified a heterozygous exon 3 deletion in PRKN and long-read sequencing identified a large novel inversion spanning over 7Mb, including a large part of the coding DNA sequence of PRKN. We could diagnose the affected subjects as compound heterozygous carriers of PRKN. We analyzed whole genome sequencing data of 43,538 participants of the UK-Biobank and 4,941 participants of the AMP-PD datasets. Nine inversions in the UK-Biobank and two in AMP PD were identified and were considered potentially damaging and likely to affect PRKN isoforms. Conclusions: This is the first report describing a large 7Mb inversion involving breakpoints outside of PRKN. This study highlights the importance of using long-read whole genome sequencing for structural variant analysis in unresolved young-onset PD cases.

5.
medRxiv ; 2023 Aug 22.
Artigo em Inglês | MEDLINE | ID: mdl-37662332

RESUMO

Cervical cancer is caused by human papillomavirus (HPV) infection, has few approved targeted therapeutics, and is the most common cause of cancer death in low-resource countries. We characterized 19 cervical and four head and neck cell lines using long-read DNA and RNA sequencing and identified the HPV types, HPV integration sites, chromosomal alterations, and cancer driver mutations. Structural variation analysis revealed telomeric deletions associated with DNA inversions resulting from breakage-fusion-bridge (BFB) cycles. BFB is a common mechanism of chromosomal alterations in cancer, and this is one of the first analyses of these events using long-read sequencing. Analysis of the inversion sites revealed staggered ends consistent with exonuclease digestion of the DNA after breakage. Some BFB events are complex, involving inter- or intra-chromosomal insertions or rearrangements. None of the BFB breakpoints had telomere sequences added to resolve the dicentric chromosomes and only one BFB breakpoint showed chromothripsis. Five cell lines have a Chr11q BFB event, with YAP1/BIRC2/BIRC3 gene amplification. Indeed, YAP1 amplification is associated with a 10-year earlier age of diagnosis of cervical cancer and is three times more common in African American women. This suggests that cervical cancer patients with YAP1/BIRC2/BIRC3-amplification, especially those of African American ancestry, might benefit from targeted therapy. In summary, we uncovered new insights into the mechanisms and consequences of BFB cycles in cervical cancer using long-read sequencing.

6.
Nat Methods ; 20(10): 1483-1492, 2023 10.
Artigo em Inglês | MEDLINE | ID: mdl-37710018

RESUMO

Long-read sequencing technologies substantially overcome the limitations of short-reads but have not been considered as a feasible replacement for population-scale projects, being a combination of too expensive, not scalable enough or too error-prone. Here we develop an efficient and scalable wet lab and computational protocol, Napu, for Oxford Nanopore Technologies long-read sequencing that seeks to address those limitations. We applied our protocol to cell lines and brain tissue samples as part of a pilot project for the National Institutes of Health Center for Alzheimer's and Related Dementias. Using a single PromethION flow cell, we can detect single nucleotide polymorphisms with F1-score comparable to Illumina short-read sequencing. Small indel calling remains difficult within homopolymers and tandem repeats, but achieves good concordance to Illumina indel calls elsewhere. Further, we can discover structural variants with F1-score on par with state-of-the-art de novo assembly methods. Our protocol phases small and structural variants at megabase scales and produces highly accurate, haplotype-specific methylation calls.


Assuntos
Genoma Humano , Sequenciamento por Nanoporos , Humanos , Análise de Sequência de DNA/métodos , Haplótipos , Metilação , Projetos Piloto , Sequenciamento de Nucleotídeos em Larga Escala/métodos
7.
Lancet Neurol ; 22(11): 1015-1025, 2023 11.
Artigo em Inglês | MEDLINE | ID: mdl-37633302

RESUMO

BACKGROUND: An understanding of the genetic mechanisms underlying diseases in ancestrally diverse populations is an important step towards development of targeted treatments. Research in African and African admixed populations can enable mapping of complex traits, because of their genetic diversity, extensive population substructure, and distinct linkage disequilibrium patterns. We aimed to do a comprehensive genome-wide assessment in African and African admixed individuals to better understand the genetic architecture of Parkinson's disease in these underserved populations. METHODS: We performed a genome-wide association study (GWAS) in people of African and African admixed ancestry with and without Parkinson's disease. Individuals were included from several cohorts that were available as a part of the Global Parkinson's Genetics Program, the International Parkinson's Disease Genomics Consortium Africa, and 23andMe. A diagnosis of Parkinson's disease was confirmed clinically by a movement disorder specialist for every individual in each cohort, except for 23andMe, in which it was self-reported based on clinical diagnosis. We characterised ancestry-specific risk, differential haplotype structure and admixture, coding and structural genetic variation, and enzymatic activity. FINDINGS: We included 197 918 individuals (1488 cases and 196 430 controls) in our genome-wide analysis. We identified a novel common risk factor for Parkinson's disease (overall meta-analysis odds ratio for risk of Parkinson's disease 1·58 [95% CI 1·37-1·80], p=2·397 × 10-14) and age at onset at the GBA1 locus, rs3115534-G (age at onset ß=-2·00 [SE=0·57], p=0·0005, for African ancestry; and ß=-4·15 [0·58], p=0·015, for African admixed ancestry), which was rare in non-African or non-African admixed populations. Downstream short-read and long-read whole-genome sequencing analyses did not reveal any coding or structural variant underlying the GWAS signal. The identified signal seems to be associated with decreased glucocerebrosidase activity. INTERPRETATION: Our study identified a novel genetic risk factor in GBA1 in people of African ancestry, which has not been seen in European populations, and it could be a major mechanistic basis of Parkinson's disease in African populations. This population-specific variant exerts substantial risk on Parkinson's disease as compared with common variation identified through GWAS and it was found to be present in 39% of the cases assessed in this study. This finding highlights the importance of understanding ancestry-specific genetic risk in complex diseases, a particularly crucial point as the Parkinson's disease field moves towards targeted treatments in clinical trials. The distinctive genetics of African populations highlights the need for equitable inclusion of ancestrally diverse groups in future trials, which will be a valuable step towards gaining insights into novel genetic determinants underlying the causes of Parkinson's disease. This finding opens new avenues towards RNA-based and other therapeutic strategies aimed at reducing lifetime risk of Parkinson's disease. FUNDING: The Global Parkinson's Genetics Program, which is funded by the Aligning Science Across Parkinson's initiative, and The Michael J Fox Foundation for Parkinson's Research.


Assuntos
População Africana , Doença de Parkinson , Humanos , População Negra/genética , Loci Gênicos , Predisposição Genética para Doença/genética , Estudo de Associação Genômica Ampla , Desequilíbrio de Ligação , Doença de Parkinson/etnologia , Doença de Parkinson/genética , Polimorfismo de Nucleotídeo Único/genética , População Africana/genética
8.
medRxiv ; 2023 May 07.
Artigo em Inglês | MEDLINE | ID: mdl-37398408

RESUMO

Background: Understanding the genetic mechanisms underlying diseases in ancestrally diverse populations is a critical step towards the realization of the global application of precision medicine. The African and African admixed populations enable mapping of complex traits given their greater levels of genetic diversity, extensive population substructure, and distinct linkage disequilibrium patterns. Methods: Here we perform a comprehensive genome-wide assessment of Parkinson's disease (PD) in 197,918 individuals (1,488 cases; 196,430 controls) of African and African admixed ancestry, characterizing population-specific risk, differential haplotype structure and admixture, coding and structural genetic variation and polygenic risk profiling. Findings: We identified a novel common risk factor for PD and age at onset at the GBA1 locus (risk, rs3115534-G; OR=1.58, 95% CI = 1.37 - 1.80, P=2.397E-14; age at onset, BETA =-2.004, SE =0.57, P = 0.0005), that was found to be rare in non-African/African admixed populations. Downstream short- and long-read whole genome sequencing analyses did not reveal any coding or structural variant underlying the GWAS signal. However, we identified that this signal mediates PD risk via expression quantitative trait locus (eQTL) mechanisms. While previously identified GBA1 associated disease risk variants are coding mutations, here we suggest a novel functional mechanism consistent with a trend in decreasing glucocerebrosidase activity levels. Given the high population frequency of the underlying signal and the phenotypic characteristics of the homozygous carriers, we hypothesize that this variant may not cause Gaucher disease. Additionally, the prevalence of Gaucher's disease in Africa is low. Interpretation: The present study identifies a novel African-ancestry genetic risk factor in GBA1 as a major mechanistic basis of PD in the African and African admixed populations. This striking result contrasts to previous work in Northern European populations, both in terms of mechanism and attributable risk. This finding highlights the importance of understanding population-specific genetic risk in complex diseases, a particularly crucial point as the field moves toward precision medicine in PD clinical trials and while recognizing the need for equitable inclusion of ancestrally diverse groups in such trials. Given the distinctive genetics of these underrepresented populations, their inclusion represents a valuable step towards insights into novel genetic determinants underlying PD etiology. This opens new avenues towards RNA-based and other therapeutic strategies aimed at reducing lifetime risk. Evidence Before this Study: Our current understanding of Parkinson's disease (PD) is disproportionately based on studying populations of European ancestry, leading to a significant gap in our knowledge about the genetics, clinical characteristics, and pathophysiology in underrepresented populations. This is particularly notable in individuals of African and African admixed ancestries. Over the last two decades, we have witnessed a revolution in the research area of complex genetic diseases. In the PD field, large-scale genome-wide association studies in the European, Asian, and Latin American populations have identified multiple risk loci associated with disease. These include 78 loci and 90 independent signals associated with PD risk in the European population, nine replicated loci and two novel population-specific signals in the Asian population, and a total of 11 novel loci recently nominated through multi-ancestry GWAS efforts.Nevertheless, the African and African admixed populations remain completely unexplored in the context of PD genetics. Added Value of this Study: To address the lack of diversity in our research field, this study aimed to conduct the first genome-wide assessment of PD genetics in the African and African admixed populations. Here, we identified a genetic risk factor linked to PD etiology, dissected African-specific differences in risk and age at onset, characterized known genetic risk factors, and highlighted the utility of the African and African admixed risk haplotype substructure for future fine-mapping efforts. We identified a novel disease mechanism via expression changes consistent with decreased GBA1 activity levels. Future large scale single cell expression studies should investigate the neuronal populations in which expression differences are most prominent. This novel mechanism may hold promise for future efficient RNA-based therapeutic strategies such as antisense oligonucleotides or short interfering RNAs aimed at preventing and decreasing disease risk. We envisage that these data generated under the umbrella of the Global Parkinson's Genetics Program (GP2) will shed light on the molecular mechanisms involved in the disease process and might pave the way for future clinical trials and therapeutic interventions. This work represents a valuable resource in an underserved population, supporting pioneering research within GP2 and beyond. Deciphering causal and genetic risk factors in all these ancestries will help determine whether interventions, potential targets for disease modifying treatment, and prevention strategies that are being studied in the European populations are relevant to the African and African admixed populations. Implications of all the Available Evidence: We nominate a novel signal impacting GBA1 as the major genetic risk factor for PD in the African and African admixed populations. The present study could inform future GBA1 clinical trials, improving patient stratification. In this regard, genetic testing can help to design trials likely to provide meaningful and actionable answers. It is our hope that these findings may ultimately have clinical utility for this underrepresented population.

9.
Cell Genom ; 3(6): 100316, 2023 Jun 14.
Artigo em Inglês | MEDLINE | ID: mdl-37388914

RESUMO

We characterized the role of structural variants, a largely unexplored type of genetic variation, in two non-Alzheimer's dementias, namely Lewy body dementia (LBD) and frontotemporal dementia (FTD)/amyotrophic lateral sclerosis (ALS). To do this, we applied an advanced structural variant calling pipeline (GATK-SV) to short-read whole-genome sequence data from 5,213 European-ancestry cases and 4,132 controls. We discovered, replicated, and validated a deletion in TPCN1 as a novel risk locus for LBD and detected the known structural variants at the C9orf72 and MAPT loci as associated with FTD/ALS. We also identified rare pathogenic structural variants in both LBD and FTD/ALS. Finally, we assembled a catalog of structural variants that can be mined for new insights into the pathogenesis of these understudied forms of dementia.

10.
bioRxiv ; 2023 Apr 05.
Artigo em Inglês | MEDLINE | ID: mdl-36711673

RESUMO

Long-read sequencing technologies substantially overcome the limitations of short-reads but to date have not been considered as feasible replacement at scale due to a combination of being too expensive, not scalable enough, or too error-prone. Here, we develop an efficient and scalable wet lab and computational protocol for Oxford Nanopore Technologies (ONT) long-read sequencing that seeks to provide a genuine alternative to short-reads for large-scale genomics projects. We applied our protocol to cell lines and brain tissue samples as part of a pilot project for the NIH Center for Alzheimer's and Related Dementias (CARD). Using a single PromethION flow cell, we can detect SNPs with F1-score better than Illumina short-read sequencing. Small indel calling remains to be difficult inside homopolymers and tandem repeats, but is comparable to Illumina calls elsewhere. Further, we can discover structural variants with F1-score comparable to state-of the-art methods involving Pacific Biosciences HiFi sequencing and trio information (but at a lower cost and greater throughput). Using ONT based phasing, we can then combine and phase small and structural variants at megabase scales. Our protocol also produces highly accurate, haplotype-specific methylation calls. Overall, this makes large-scale long-read sequencing projects feasible; the protocol is currently being used to sequence thousands of brain-based genomes as a part of the NIH CARD initiative. We provide the protocol and software as open-source integrated pipelines for generating phased variant calls and assemblies.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA