Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 186
Filtrar
1.
Nat Genet ; 56(5): 778-791, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38689001

RESUMO

Hypertension affects more than one billion people worldwide. Here we identify 113 novel loci, reporting a total of 2,103 independent genetic signals (P < 5 × 10-8) from the largest single-stage blood pressure (BP) genome-wide association study to date (n = 1,028,980 European individuals). These associations explain more than 60% of single nucleotide polymorphism-based BP heritability. Comparing top versus bottom deciles of polygenic risk scores (PRSs) reveals clinically meaningful differences in BP (16.9 mmHg systolic BP, 95% CI, 15.5-18.2 mmHg, P = 2.22 × 10-126) and more than a sevenfold higher odds of hypertension risk (odds ratio, 7.33; 95% CI, 5.54-9.70; P = 4.13 × 10-44) in an independent dataset. Adding PRS into hypertension-prediction models increased the area under the receiver operating characteristic curve (AUROC) from 0.791 (95% CI, 0.781-0.801) to 0.826 (95% CI, 0.817-0.836, ∆AUROC, 0.035, P = 1.98 × 10-34). We compare the 2,103 loci results in non-European ancestries and show significant PRS associations in a large African-American sample. Secondary analyses implicate 500 genes previously unreported for BP. Our study highlights the role of increasingly large genomic studies for precision health research.


Assuntos
Pressão Sanguínea , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Hipertensão , Herança Multifatorial , Polimorfismo de Nucleotídeo Único , Feminino , Humanos , Masculino , Pressão Sanguínea/genética , Estratificação de Risco Genético , Hipertensão/genética , Fatores de Risco
2.
iScience ; 27(2): 109001, 2024 Feb 16.
Artigo em Inglês | MEDLINE | ID: mdl-38352226

RESUMO

Early detection is key for increased survival in ovarian cancer, but no general screening program exists today due to lack of biomarkers and overall cost versus benefit over traditional clinical methods. Here, we used dried cervico-vaginal fluid (CVF) as sampling matrix coupled with mass spectrometry for detection of protein biomarkers. We find that self-collected CVF on paper cards yields robust results and is suitable for high-throughput proteomics. Artificial intelligence-based methods were used to identify an 11-protein panel that separates cases from controls. In validation data, the panel achieved a sensitivity of 0.97 (95% CI 0.91-1.00) at a specificity of 0.67 (0.40-0.87). Analyses of samples collected prior to development of symptoms indicate that the panel is informative also of future risk of disease. Dried CVF is used in cervical cancer screening, and our results opens the possibility for a screening program also for ovarian cancer, based on self-collected CVF samples.

3.
Plant J ; 117(3): 944-955, 2024 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-37947292

RESUMO

Scots pine (Pinus sylvestris L.) is one of the most widespread and economically important conifer species in the world. Applications like genomic selection and association studies, which could help accelerate breeding cycles, are challenging in Scots pine because of its large and repetitive genome. For this reason, genotyping tools for conifer species, and in particular for Scots pine, are commonly based on transcribed regions of the genome. In this article, we present the Axiom Psyl50K array, the first single nucleotide polymorphism (SNP) genotyping array for Scots pine based on whole-genome resequencing, that represents both genic and intergenic regions. This array was designed following a two-step procedure: first, 192 trees were sequenced, and a 430K SNP screening array was constructed. Then, 480 samples, including haploid megagametophytes, full-sib family trios, breeding population, and range-wide individuals from across Eurasia were genotyped with the screening array. The best 50K SNPs were selected based on quality, replicability, distribution across the draft genome assembly, balance between genic and intergenic regions, and genotype-environment and genotype-phenotype associations. Of the final 49 877 probes tiled in the array, 20 372 (40.84%) occur inside gene models, while the rest lie in intergenic regions. We also show that the Psyl50K array can yield enough high-confidence SNPs for genetic studies in pine species from North America and Eurasia. This new genotyping tool will be a valuable resource for high-throughput fundamental and applied research of Scots pine and other pine species.


Assuntos
Pinus sylvestris , Pinus , Humanos , Pinus sylvestris/genética , Polimorfismo de Nucleotídeo Único/genética , Genótipo , Melhoramento Vegetal , Pinus/genética , DNA Intergênico
4.
Genes (Basel) ; 14(11)2023 Oct 25.
Artigo em Inglês | MEDLINE | ID: mdl-38002932

RESUMO

The development of complete mitochondrial genome (mitogenome) reference data for inclusion in publicly available population databases is currently underway, and the generation of more high-quality mitogenomes will only enhance the statistical power of this forensically useful locus. To characterize mitogenome variation in Sweden, the mitochondrial DNA (mtDNA) reads from the SweGen whole genome sequencing (WGS) dataset were analyzed. To overcome the interference from low-frequency nuclear mtDNA segments (NUMTs), a 10% variant frequency threshold was applied for the analysis. In total, 934 forensic-quality mitogenome haplotypes were characterized. Almost 45% of the SweGen haplotypes belonged to haplogroup H. Nearly all mitogenome haplotypes (99.1%) were assigned to European haplogroups, which was expected based on previous mtDNA studies of the Swedish population. There were signature northern Swedish and Finnish haplogroups observed in the dataset (e.g., U5b1, W1a), consistent with the nuclear DNA analyses of the SweGen data. The complete mitogenome analysis resulted in high haplotype diversity (0.9996) with a random match probability of 0.15%. Overall, the SweGen mitogenomes provide a large mtDNA reference dataset for the Swedish population and also contribute to the effort to estimate global mitogenome haplotype frequencies.


Assuntos
DNA Mitocondrial , Genoma Mitocondrial , Suécia , Análise de Sequência de DNA , DNA Mitocondrial/genética , Genoma Mitocondrial/genética , Mitocôndrias/genética
6.
Nat Immunol ; 24(9): 1540-1551, 2023 09.
Artigo em Inglês | MEDLINE | ID: mdl-37563310

RESUMO

Circulating proteins have important functions in inflammation and a broad range of diseases. To identify genetic influences on inflammation-related proteins, we conducted a genome-wide protein quantitative trait locus (pQTL) study of 91 plasma proteins measured using the Olink Target platform in 14,824 participants. We identified 180 pQTLs (59 cis, 121 trans). Integration of pQTL data with eQTL and disease genome-wide association studies provided insight into pathogenesis, implicating lymphotoxin-α in multiple sclerosis. Using Mendelian randomization (MR) to assess causality in disease etiology, we identified both shared and distinct effects of specific proteins across immune-mediated diseases, including directionally discordant effects of CD40 on risk of rheumatoid arthritis versus multiple sclerosis and inflammatory bowel disease. MR implicated CXCL5 in the etiology of ulcerative colitis (UC) and we show elevated gut CXCL5 transcript expression in patients with UC. These results identify targets of existing drugs and provide a powerful resource to facilitate future drug target prioritization.


Assuntos
Colite Ulcerativa , Doenças Inflamatórias Intestinais , Esclerose Múltipla , Humanos , Estudo de Associação Genômica Ampla , Doenças Inflamatórias Intestinais/genética , Locos de Características Quantitativas , Colite Ulcerativa/tratamento farmacológico , Colite Ulcerativa/genética , Inflamação/genética , Esclerose Múltipla/genética , Polimorfismo de Nucleotídeo Único
7.
Nat Commun ; 14(1): 4308, 2023 07 18.
Artigo em Inglês | MEDLINE | ID: mdl-37463882

RESUMO

A comprehensive characterization of blood proteome profiles in cancer patients can contribute to a better understanding of the disease etiology, resulting in earlier diagnosis, risk stratification and better monitoring of the different cancer subtypes. Here, we describe the use of next generation protein profiling to explore the proteome signature in blood across patients representing many of the major cancer types. Plasma profiles of 1463 proteins from more than 1400 cancer patients are measured in minute amounts of blood collected at the time of diagnosis and before treatment. An open access Disease Blood Atlas resource allows the exploration of the individual protein profiles in blood collected from the individual cancer patients. We also present studies in which classification models based on machine learning have been used for the identification of a set of proteins associated with each of the analyzed cancers. The implication for cancer precision medicine of next generation plasma profiling is discussed.


Assuntos
Neoplasias Hematológicas , Neoplasias , Humanos , Proteoma/metabolismo , Neoplasias/diagnóstico , Neoplasias/metabolismo , Medicina de Precisão , Aprendizado de Máquina
8.
Res Sq ; 2023 Mar 31.
Artigo em Inglês | MEDLINE | ID: mdl-37034613

RESUMO

Understanding the genetic basis of neuro-related proteins is essential for dissecting the molecular basis of human behavioral traits and the disease etiology of neuropsychiatric disorders. Here, the SCALLOP Consortium conducted a genome-wide association meta-analysis of over 12,500 individuals for 184 neuro-related proteins in human plasma. The analysis identified 117 cis-regulatory protein quantitative trait loci (cis-pQTL) and 166 trans-pQTL. The mapped pQTL capture on average 50% of each protein's heritability. Mendelian randomization analyses revealed multiple proteins showing potential causal effects on neuro-related traits such as sleeping, smoking, feelings, alcohol intake, mental health, and psychiatric disorders. Integrating with established drug information, we validated 13 out of 13 matched combinations of protein targets and diseases or side effects with available drugs, while suggesting hundreds of re-purposing and new therapeutic targets. This consortium effort provides a large-scale proteogenomic resource for biomedical research on human behaviors and other neuro-related phenotypes.

9.
Forensic Sci Int Genet ; 65: 102871, 2023 07.
Artigo em Inglês | MEDLINE | ID: mdl-37054667

RESUMO

The prediction of human characteristics from blood using molecular markers would be very helpful in forensic science. Such information can be particularly important in providing investigative leads in police casework from, for example, blood found at crime scenes in cases without a suspect. Here, we investigated the possibilities and limitations of predicting seven phenotypic traits (sex, age, height, body mass index [BMI], hip-to-waist [WTH] ratio, smoking status and lipid-lowering drug use) using either DNA methylation or plasma proteins separately or in combination. We developed a prediction pipeline starting with the prediction of sex followed by sex-specific, stepwise, individual age, sex-specific anthropometric traits and, finally, lifestyle-related traits. Our data revealed that age, sex and smoking status can be accurately predicted from DNA methylation alone, while the use of plasma proteins was highly accurate for prediction of the WTH ratio, and a combined analysis of the best predictions for BMI and lipid-lowering drug use. In unseen individuals, age was predicted with a standard error of 3.3 years for women and 6.5 years for men, while the accuracy in smoking prediction across both men and women was 0.86. In conclusion, we have developed a stepwise approach for the de-novo prediction of individual characteristics from plasma proteins and DNA methylation markers. These models are accurate and may provide valuable information and investigative leads in future forensic casework.


Assuntos
Lipídeos , Fumar , Masculino , Humanos , Feminino , Pré-Escolar , Índice de Massa Corporal , Marcadores Genéticos , Proteínas Sanguíneas , Epigênese Genética
10.
medRxiv ; 2023 Feb 14.
Artigo em Inglês | MEDLINE | ID: mdl-36824751

RESUMO

Understanding the genetic basis of neuro-related proteins is essential for dissecting the disease etiology of neuropsychiatric disorders and other complex traits and diseases. Here, the SCALLOP Consortium conducted a genome-wide association meta-analysis of over 12,500 individuals for 184 neuro-reiated proteins in human plasma. The analysis identified 117 cis-regulatory protein quantitative trait loci (cis-pQTL) and 166 trans-pQTL. The mapped pQTL capture on average 50% of each protein's heritability. Mendelian randomization analyses revealed multiple proteins showing potential causal effects on neuro-reiated traits as well as complex diseases such as hypertension, high cholesterol, immune-related disorders, and psychiatric disorders. Integrating with established drug information, we validated 13 combinations of protein targets and diseases or side effects with available drugs, while suggesting hundreds of re-purposing and new therapeutic targets for diseases and comorbidities. This consortium effort provides a large-scale proteogenomic resource for biomedical research.

11.
Mol Psychiatry ; 28(1): 475-482, 2023 01.
Artigo em Inglês | MEDLINE | ID: mdl-36380236

RESUMO

Tandem repeat expansions (TREs) are associated with over 60 monogenic disorders and have recently been implicated in complex disorders such as cancer and autism spectrum disorder. The role of TREs in schizophrenia is now emerging. In this study, we have performed a genome-wide investigation of TREs in schizophrenia. Using genome sequence data from 1154 Swedish schizophrenia cases and 934 ancestry-matched population controls, we have detected genome-wide rare (<0.1% population frequency) TREs that have motifs with a length of 2-20 base pairs. We find that the proportion of individuals carrying rare TREs is significantly higher in the schizophrenia group. There is a significantly higher burden of rare TREs in schizophrenia cases than in controls in genic regions, particularly in postsynaptic genes, in genes overlapping brain expression quantitative trait loci, and in brain-expressed genes that are differentially expressed between schizophrenia cases and controls. We demonstrate that TRE-associated genes are more constrained and primarily impact synaptic and neuronal signaling functions. These results have been replicated in an independent Canadian sample that consisted of 252 schizophrenia cases of European ancestry and 222 ancestry-matched controls. Our results support the involvement of rare TREs in schizophrenia etiology.


Assuntos
Transtorno do Espectro Autista , Esquizofrenia , Humanos , Esquizofrenia/genética , Estudo de Associação Genômica Ampla , Canadá , Frequência do Gene , Predisposição Genética para Doença/genética
12.
Neuro Oncol ; 25(1): 97-107, 2023 01 05.
Artigo em Inglês | MEDLINE | ID: mdl-35738865

RESUMO

BACKGROUND: Malignant gliomas, the most common malignant brain tumors in adults, represent a heterogeneous group of diseases with poor prognosis. Retroviruses can cause permanent genetic alterations that modify genes close to the viral integration site. METHODS: Here we describe the use of a high-throughput pipeline coupled to the commonly used tissue-specific retroviral RCAS-TVA mouse tumor model system. Utilizing next-generation sequencing, we show that retroviral integration sites can be reproducibly detected in malignant stem cell lines generated from RCAS-PDGFB-driven glioma biopsies. RESULTS: A large fraction of common integration sites contained genes that have been dysregulated or misexpressed in glioma. Others overlapped with loci identified in previous glioma-related forward genetic screens, but several novel putative cancer-causing genes were also found. Integrating retroviral tagging and clinical data, Ppfibp1 was highlighted as a frequently tagged novel glioma-causing gene. Retroviral integrations into the locus resulted in Ppfibp1 upregulation, and Ppfibp1-tagged cells generated tumors with shorter latency on orthotopic transplantation. In human gliomas, increased PPFIBP1 expression was significantly linked to poor prognosis and PDGF treatment resistance. CONCLUSIONS: Altogether, the current study has demonstrated a novel approach to tagging glioma genes via forward genetics, validating previous results, and identifying PPFIBP1 as a putative oncogene in gliomagenesis.


Assuntos
Neoplasias Encefálicas , Glioma , Animais , Humanos , Camundongos , Neoplasias Encefálicas/patologia , Estudos de Associação Genética , Glioma/patologia , Oncogenes , Proteínas Proto-Oncogênicas c-sis/genética
13.
Commun Med (Lond) ; 2: 124, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36196264

RESUMO

Background: Ovarian cancer is the eighth most common cancer among women and due to late detection prognosis is poor with an overall 5-year survival of 30-50%. Novel biomarkers are needed to reduce diagnostic surgery and enable detection of early-stage cancer by population screening. We have previously developed a risk score based on an 11-biomarker plasma protein assay to distinguish benign tumors (cysts) from malignant ovarian cancer in women with adnexal ovarian mass. Methods: Protein concentrations of 11 proteins were characterized in plasma from 1120 clinical samples with a custom version of the proximity extension assay. The performance of the assay was evaluated in terms of prediction accuracy based on receiver operating characteristics (ROC) and multiple hypothesis adjusted Fisher's Exact tests on achieved sensitivity and specificity. Results: The assay's performance is validated in two independent clinical cohorts with a sensitivity of 0.83/0.91 and specificity of 0.88/0.92. We also show that the risk score follows the clinical development and is reduced upon treatment, and increased with relapse and cancer progression. Data-driven modeling of the risk score patterns during a 2-year follow-up after diagnosis identifies four separate risk score trajectories linked to clinical development and survival. A Cox proportional hazard regression analysis of 5-year survival shows that at time of diagnosis the risk score is the second-strongest predictive variable for survival after tumor stage, whereas MUCIN-16 (CA-125) alone is not significantly predictive. Conclusion: The robust performance of the biomarker assay across clinical cohorts and the correlation with clinical development indicates its usefulness both in the diagnostic work-up of women with adnexal ovarian mass and for predicting their clinical course.

15.
Infect Agent Cancer ; 17(1): 42, 2022 Aug 03.
Artigo em Inglês | MEDLINE | ID: mdl-35922825

RESUMO

PURPOSE: Cervical cancer prevention for older women can be challenging since there are no specific guidelines for this group. This study aimed to determine the incidence of oncogenic HPV and HPV-related dysplasia in elderly women 5 years after being HPV negative. METHODS: Invited women participated five years earlier in a study where self-sampling for HPV testing was applied, at this time, they were all HPV negative. The women were now, five years later invited to perform self-sampling for HPV testing. Women with a positive result performed a repeat HPV test. Those with a positive repeat HPV test were examined by colposcopy, biopsy and cytology. RESULTS: Of the 804 invited women, 634 (76.9%) agreed to participate in the study and a self-sampling kit was sent to them. Of these, 99.6% (632/634) sent a sample to the HPV laboratory. The participation rate in each age group was 93.3% at age 65, 74.0% at age 70, 80.7% at age 75 and 64.6% at age 80. Overall 18 women (2.8%, 95% CI 3.2 to 6.0) were HPV positive in the first test and 8 (1.3%, 95% CI 0.6 to 2.6) in the second test. Sampling for the second test was done on average 5.4 months after the first test. Fifty per cent (4/8) of the women with a positive repeat test had dysplasia in histology. CONCLUSION: The incidence of HPV in previously HPV-negative elderly women was low. Among women who were HPV positive in a repeat test, there was a high prevalence of low grade dysplasia.

16.
Nat Commun ; 13(1): 2532, 2022 05 09.
Artigo em Inglês | MEDLINE | ID: mdl-35534486

RESUMO

Despite the success of genome-wide association studies, much of the genetic contribution to complex traits remains unexplained. Here, we analyse high coverage whole-genome sequencing data, to evaluate the contribution of rare genetic variants to 414 plasma proteins. The frequency distribution of genetic variants is skewed towards the rare spectrum, and damaging variants are more often rare. We estimate that less than 4.3% of the narrow-sense heritability is expected to be explained by rare variants in our cohort. Using a gene-based approach, we identify Cis-associations for 237 of the proteins, which is slightly more compared to a GWAS (N = 213), and we identify 34 associated loci in Trans. Several associations are driven by rare variants, which have larger effects, on average. We therefore conclude that rare variants could be of importance for precision medicine applications, but have a more limited contribution to the missing heritability of complex diseases.


Assuntos
Estudo de Associação Genômica Ampla , Herança Multifatorial , Proteínas Sanguíneas/genética , Predisposição Genética para Doença , Variação Genética , Humanos , Herança Multifatorial/genética , Polimorfismo de Nucleotídeo Único , Sequenciamento Completo do Genoma
17.
Cancers (Basel) ; 14(7)2022 Mar 30.
Artigo em Inglês | MEDLINE | ID: mdl-35406529

RESUMO

BACKGROUND: Ovarian cancer is the eighth most common cancer among women and has a 5-year survival of only 30-50%. The survival is close to 90% for patients in stage I but only 20% for patients in stage IV. The presently available biomarkers have insufficient sensitivity and specificity for early detection and there is an urgent need to identify novel biomarkers. METHODS: We employed the Explore PEA technology for high-precision analysis of 1463 plasma proteins and conducted a discovery and replication study using two clinical cohorts of previously untreated patients with benign or malignant ovarian tumours (N = 111 and N = 37). RESULTS: The discovery analysis identified 32 proteins that had significantly higher levels in malignant cases as compared to benign diagnoses, and for 28 of these, the association was replicated in the second cohort. Multivariate modelling identified three highly accurate models based on 4 to 7 proteins each for separating benign tumours from early-stage and/or late-stage ovarian cancers, all with AUCs above 0.96 in the replication cohort. We also developed a model for separating the early-stage from the late-stage achieving an AUC of 0.81 in the replication cohort. These models were based on eleven proteins in total (ALPP, CXCL8, DPY30, IL6, IL12, KRT19, PAEP, TSPAN1, SIGLEC5, VTCN1, and WFDC2), notably without MUCIN-16. The majority of the associated proteins have been connected to ovarian cancer but not identified as potential biomarkers. CONCLUSIONS: The results show the ability of using high-precision proteomics for the identification of novel plasma protein biomarker candidates for the early detection of ovarian cancer.

18.
Circulation ; 145(18): 1398-1411, 2022 05 03.
Artigo em Inglês | MEDLINE | ID: mdl-35387486

RESUMO

BACKGROUND: SARS-CoV-2, the causal agent of COVID-19, enters human cells using the ACE2 (angiotensin-converting enzyme 2) protein as a receptor. ACE2 is thus key to the infection and treatment of the coronavirus. ACE2 is highly expressed in the heart and respiratory and gastrointestinal tracts, playing important regulatory roles in the cardiovascular and other biological systems. However, the genetic basis of the ACE2 protein levels is not well understood. METHODS: We have conducted the largest genome-wide association meta-analysis of plasma ACE2 levels in >28 000 individuals of the SCALLOP Consortium (Systematic and Combined Analysis of Olink Proteins). We summarize the cross-sectional epidemiological correlates of circulating ACE2. Using the summary statistics-based high-definition likelihood method, we estimate relevant genetic correlations with cardiometabolic phenotypes, COVID-19, and other human complex traits and diseases. We perform causal inference of soluble ACE2 on vascular disease outcomes and COVID-19 severity using mendelian randomization. We also perform in silico functional analysis by integrating with other types of omics data. RESULTS: We identified 10 loci, including 8 novel, capturing 30% of the heritability of the protein. We detected that plasma ACE2 was genetically correlated with vascular diseases, severe COVID-19, and a wide range of human complex diseases and medications. An X-chromosome cis-protein quantitative trait loci-based mendelian randomization analysis suggested a causal effect of elevated ACE2 levels on COVID-19 severity (odds ratio, 1.63 [95% CI, 1.10-2.42]; P=0.01), hospitalization (odds ratio, 1.52 [95% CI, 1.05-2.21]; P=0.03), and infection (odds ratio, 1.60 [95% CI, 1.08-2.37]; P=0.02). Tissue- and cell type-specific transcriptomic and epigenomic analysis revealed that the ACE2 regulatory variants were enriched for DNA methylation sites in blood immune cells. CONCLUSIONS: Human plasma ACE2 shares a genetic basis with cardiovascular disease, COVID-19, and other related diseases. The genetic architecture of the ACE2 protein is mapped, providing a useful resource for further biological and clinical studies on this coronavirus receptor.


Assuntos
Enzima de Conversão de Angiotensina 2 , COVID-19 , Enzima de Conversão de Angiotensina 2/genética , COVID-19/genética , Estudos Transversais , Estudo de Associação Genômica Ampla , Humanos , Receptores de Coronavírus , SARS-CoV-2
19.
Nat Commun ; 13(1): 627, 2022 02 02.
Artigo em Inglês | MEDLINE | ID: mdl-35110541

RESUMO

CRISPR-Cas9 genome editing has potential to cure diseases without current treatments, but therapies must be safe. Here we show that CRISPR-Cas9 editing can introduce unintended mutations in vivo, which are passed on to the next generation. By editing fertilized zebrafish eggs using four guide RNAs selected for off-target activity in vitro, followed by long-read sequencing of DNA from >1100 larvae, juvenile and adult fish across two generations, we find that structural variants (SVs), i.e., insertions and deletions ≥50 bp, represent 6% of editing outcomes in founder larvae. These SVs occur both at on-target and off-target sites. Our results also illustrate that adult founder zebrafish are mosaic in their germ cells, and that 26% of their offspring carries an off-target mutation and 9% an SV. Hence, pre-testing for off-target activity and SVs using patient material is advisable in clinical applications, to reduce the risk of unanticipated effects with potentially large implications.


Assuntos
Sistemas CRISPR-Cas , Edição de Genes/métodos , Peixe-Zebra/genética , Animais , DNA , Terapia Genética , Células Germinativas , Humanos , Mutação , RNA Guia de Cinetoplastídeos/genética
20.
Int J Mol Sci ; 23(4)2022 Feb 17.
Artigo em Inglês | MEDLINE | ID: mdl-35216360

RESUMO

Whole-genome sequencing (WGS) data present a readily available resource for mitochondrial genome (mitogenome) haplotypes that can be utilized for genetics research including population studies. However, the reconstruction of the mitogenome is complicated by nuclear mitochondrial DNA (mtDNA) segments (NUMTs) that co-align with the mtDNA sequences and mimic authentic heteroplasmy. Two minimum variant detection thresholds, 5% and 10%, were assessed for the ability to produce authentic mitogenome haplotypes from a previously generated WGS dataset. Variants associated with NUMTs were detected in the mtDNA alignments for 91 of 917 (~8%) Swedish samples when the 5% frequency threshold was applied. The 413 observed NUMT variants were predominantly detected in two regions (nps 12,612-13,105 and 16,390-16,527), which were consistent with previously documented NUMTs. The number of NUMT variants was reduced by ~97% (400) using a 10% frequency threshold. Furthermore, the 5% frequency data were inconsistent with a platinum-quality mitogenome dataset with respect to observed heteroplasmy. These analyses illustrate that a 10% variant detection threshold may be necessary to ensure the generation of reliable mitogenome haplotypes from WGS data resources.


Assuntos
DNA Mitocondrial/genética , Genoma Mitocondrial/genética , Haplótipos/genética , Mitocôndrias/genética , Núcleo Celular/genética , Humanos , Sequenciamento Completo do Genoma/métodos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...