Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 716
Filtrar
Más filtros

Intervalo de año de publicación
1.
Am J Hum Genet ; 111(4): 654-667, 2024 04 04.
Artículo en Inglés | MEDLINE | ID: mdl-38471507

RESUMEN

Allele-specific methylation (ASM) is an epigenetic modification whereby one parental allele becomes methylated and the other unmethylated at a specific locus. ASM is most often driven by the presence of nearby heterozygous variants that influence methylation, but also occurs somatically in the context of genomic imprinting. In this study, we investigate ASM using publicly available single-cell reduced representation bisulfite sequencing (scRRBS) data on 608 B cells sampled from six healthy B cell samples and 1,230 cells from 11 chronic lymphocytic leukemia (CLL) samples. We developed a likelihood-based criterion to test whether a CpG exhibited ASM, based on the distributions of methylated and unmethylated reads both within and across cells. Applying our likelihood ratio test, 65,998 CpG sites exhibited ASM in healthy B cell samples according to a Bonferroni criterion (p < 8.4 × 10-9), and 32,862 CpG sites exhibited ASM in CLL samples (p < 8.5 × 10-9). We also called ASM at the sample level. To evaluate the accuracy of our method, we called heterozygous variants from the scRRBS data, which enabled variant-based calls of ASM within each cell. Comparing sample-level ASM calls to the variant-based measures of ASM, we observed a positive predictive value of 76%-100% across samples. We observed high concordance of ASM across samples and an overrepresentation of ASM in previously reported imprinted genes and genes with imprinting binding motifs. Our study demonstrates that single-cell bisulfite sequencing is a potentially powerful tool to investigate ASM, especially as studies expand to increase the number of samples and cells sequenced.


Asunto(s)
Metilación de ADN , Leucemia Linfocítica Crónica de Células B , Sulfitos , Humanos , Metilación de ADN/genética , Alelos , Leucemia Linfocítica Crónica de Células B/genética , Funciones de Verosimilitud , Impresión Genómica/genética , Islas de CpG/genética
2.
Am J Hum Genet ; 111(8): 1770-1781, 2024 Aug 08.
Artículo en Inglés | MEDLINE | ID: mdl-39047729

RESUMEN

Allele-specific expression plays a crucial role in unraveling various biological mechanisms, including genomic imprinting and gene expression controlled by cis-regulatory variants. However, existing methods for quantification from RNA-sequencing (RNA-seq) reads do not adequately and efficiently remove various allele-specific read mapping biases, such as reference bias arising from reads containing the alternative allele that do not map to the reference transcriptome or ambiguous mapping bias caused by reads containing the reference allele that map differently from reads containing the alternative allele. We present Ornaments, a computational tool for rapid and accurate estimation of allele-specific transcript expression at unphased heterozygous loci from RNA-seq reads while correcting for allele-specific read mapping biases. Ornaments removes reference bias by mapping reads to a personalized transcriptome and ambiguous mapping bias by probabilistically assigning reads to multiple transcripts and variant loci they map to. Ornaments is a lightweight extension of kallisto, a popular tool for fast RNA-seq quantification, that improves the efficiency and accuracy of WASP, a popular tool for bias correction in allele-specific read mapping. In experiments with simulated and human lymphoblastoid cell-line RNA-seq reads with the genomes of the 1000 Genomes Project, we demonstrate that Ornaments improves the accuracy of WASP and kallisto, is nearly as efficient as kallisto, and is an order of magnitude faster than WASP per sample, with the additional cost of constructing a personalized index for multiple samples. Additionally, we show that Ornaments finds imprinted transcripts with higher sensitivity than WASP, which detects imprinted signals only at gene level.


Asunto(s)
Alelos , Humanos , Transcriptoma/genética , Impresión Genómica , Análisis de Secuencia de ARN/métodos , Programas Informáticos , Perfilación de la Expresión Génica/métodos
3.
Brief Bioinform ; 25(2)2024 Jan 22.
Artículo en Inglés | MEDLINE | ID: mdl-38517695

RESUMEN

Given the universality of autopolyploid species in nature, it is crucial to develop genomic selection methods that consider different allele dosages for autopolyploid breeding. However, no method has been developed to deal with autopolyploid data regardless of the ploidy level. In this study, we developed a modified genomic best linear unbiased prediction (GBLUP) model (polyGBLUP) through constructing additive and dominant genomic relationship matrices based on different allele dosages. polyGBLUP could carry out genomic prediction for autopolyploid species regardless of the ploidy level. Through comprehensive simulations and analysis of real data of autotetraploid blueberry and guinea grass and autohexaploid sweet potato, the results showed that polyGBLUP achieved higher prediction accuracy than GBLUP and its superiority was more obvious when the ploidy level of autopolyploids is high. Furthermore, when the dominant effect was added to polyGBLUP (polyGDBLUP), the greater the dominance degree, the more obvious the advantages of polyGDBLUP over the diploid models in terms of prediction accuracy, bias, mean squared error and mean absolute error. For real data, the superiority of polyGBLUP over GBLUP appeared in blueberry and sweet potato populations and a part of the traits in guinea grass population due to the high correlation coefficients between diploid and polyploidy genomic relationship matrices. In addition, polyGDBLUP did not produce higher prediction accuracy than polyGBLUP for most traits of real data as dominant genetic variance was not captured for these traits. Our study will be a significant promising method for genomic prediction of autopolyploid species.


Asunto(s)
Genoma , Genómica , Humanos , Genómica/métodos , Fenotipo , Ploidias , Poliploidía , Modelos Genéticos , Genotipo , Polimorfismo de Nucleótido Simple
4.
Brief Bioinform ; 25(4)2024 May 23.
Artículo en Inglés | MEDLINE | ID: mdl-38920343

RESUMEN

While significant strides have been made in predicting neoepitopes that trigger autologous CD4+ T cell responses, accurately identifying the antigen presentation by human leukocyte antigen (HLA) class II molecules remains a challenge. This identification is critical for developing vaccines and cancer immunotherapies. Current prediction methods are limited, primarily due to a lack of high-quality training epitope datasets and algorithmic constraints. To predict the exogenous HLA class II-restricted peptides across most of the human population, we utilized the mass spectrometry data to profile >223 000 eluted ligands over HLA-DR, -DQ, and -DP alleles. Here, by integrating these data with peptide processing and gene expression, we introduce HLAIImaster, an attention-based deep learning framework with adaptive domain knowledge for predicting neoepitope immunogenicity. Leveraging diverse biological characteristics and our enhanced deep learning framework, HLAIImaster is significantly improved against existing tools in terms of positive predictive value across various neoantigen studies. Robust domain knowledge learning accurately identifies neoepitope immunogenicity, bridging the gap between neoantigen biology and the clinical setting and paving the way for future neoantigen-based therapies to provide greater clinical benefit. In summary, we present a comprehensive exploitation of the immunogenic neoepitope repertoire of cancers, facilitating the effective development of "just-in-time" personalized vaccines.


Asunto(s)
Aprendizaje Profundo , Antígenos de Histocompatibilidad Clase II , Humanos , Antígenos de Histocompatibilidad Clase II/inmunología , Epítopos/inmunología , Biología Computacional/métodos , Epítopos de Linfocito T/inmunología
5.
J Biol Chem ; 300(7): 107443, 2024 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-38838773

RESUMEN

Functional variants of the gene for the cytokine macrophage migration inhibitory factor (MIF) are defined by a 4-nucleotide promoter microsatellite (-794 CATT5-8, rs5844572) and confer risk for autoimmune, infectious, and oncologic diseases. We describe herein the discovery of a prototypic, small molecule inhibitor of MIF transcription with selectivity for high microsatellite repeat number and correspondingly high gene expression. Utilizing a high-throughput luminescent proximity screen, we identify 1-carbomethoxy-5-formyl-4,6,8-trihydroxyphenazine (CMFT) to inhibit the functional interaction between the transcription factor ICBP90 (namely, UHRF1) and the MIF -794 CATT5-8 promoter microsatellite. CMFT inhibits MIF mRNA expression in a -794 CATT5-8 length-dependent manner with an IC50 of 470 nM, and preferentially reduces ICBP90-dependent MIF mRNA and protein expression in high-genotypic versus low-genotypic MIF-expressing macrophages. RNA expression analysis also showed CMFT to downregulate MIF-dependent, inflammatory gene expression with little evidence of off-target metabolic toxicity. These findings provide proof-of-concept for advancing the pharmacogenomic development of precision-based MIF inhibitors for diverse autoimmune and inflammatory conditions.


Asunto(s)
Oxidorreductasas Intramoleculares , Factores Inhibidores de la Migración de Macrófagos , Factores Inhibidores de la Migración de Macrófagos/genética , Factores Inhibidores de la Migración de Macrófagos/antagonistas & inhibidores , Factores Inhibidores de la Migración de Macrófagos/metabolismo , Factores Inhibidores de la Migración de Macrófagos/inmunología , Humanos , Oxidorreductasas Intramoleculares/genética , Oxidorreductasas Intramoleculares/antagonistas & inhibidores , Oxidorreductasas Intramoleculares/metabolismo , Alelos , Repeticiones de Microsatélite , Regiones Promotoras Genéticas , Animales , Macrófagos/metabolismo , Macrófagos/inmunología , Macrófagos/efectos de los fármacos , Transcripción Genética/efectos de los fármacos , Ratones , Proteínas Potenciadoras de Unión a CCAAT/genética , Proteínas Potenciadoras de Unión a CCAAT/metabolismo
6.
Plant J ; 2024 Jul 08.
Artículo en Inglés | MEDLINE | ID: mdl-38976378

RESUMEN

The utilization of rice heterosis is essential for ensuring global food security; however, its molecular mechanism remains unclear. In this study, comprehensive analyses of accessible chromatin regions (ACRs), DNA methylation, and gene expression in inter-subspecific hybrid and its parents were performed to determine the potential role of chromatin accessibility in rice heterosis. The hybrid exhibited abundant ACRs, in which the gene ACRs and proximal ACRs were directly related to transcriptional activation rather than the distal ACRs. Regarding the dynamic accessibility contribution of the parents, paternal ZHF1015 transmitted a greater number of ACRs to the hybrid. Accessible genotype-specific target genes were enriched with overrepresented transcription factors, indicating a unique regulatory network of genes in the hybrid. Compared with its parents, the differentially accessible chromatin regions with upregulated chromatin accessibility were much greater than those with downregulated chromatin accessibility, reflecting a stronger regulation in the hybrid. Furthermore, DNA methylation levels were negatively correlated with ACR intensity, and genes were strongly affected by CHH methylation in the hybrid. Chromatin accessibility positively regulated the overall expression level of each genotype. ACR-related genes with maternal Z04A-bias allele-specific expression tended to be enriched during carotenoid biosynthesis, whereas paternal ZHF1015-bias genes were more active in carbohydrate metabolism. Our findings provide a new perspective on the mechanism of heterosis based on chromatin accessibility in inter-subspecific hybrid rice.

7.
Mol Biol Evol ; 41(2)2024 Feb 01.
Artículo en Inglés | MEDLINE | ID: mdl-38410843

RESUMEN

In the African weakly electric fish genus Campylomormyrus, electric organ discharge signals are strikingly different in shape and duration among closely related species, contribute to prezygotic isolation, and may have triggered an adaptive radiation. We performed mRNA sequencing on electric organs and skeletal muscles (from which the electric organs derive) from 3 species with short (0.4 ms), medium (5 ms), and long (40 ms) electric organ discharges and 2 different cross-species hybrids. We identified 1,444 upregulated genes in electric organ shared by all 5 species/hybrid cohorts, rendering them candidate genes for electric organ-specific properties in Campylomormyrus. We further identified several candidate genes, including KCNJ2 and KLF5, and their upregulation may contribute to increased electric organ discharge duration. Hybrids between a short (Campylomormyrus compressirostris) and a long (Campylomormyrus rhynchophorus) discharging species exhibit electric organ discharges of intermediate duration and showed imbalanced expression of KCNJ2 alleles, pointing toward a cis-regulatory difference at this locus, relative to electric organ discharge duration. KLF5 is a transcription factor potentially balancing potassium channel gene expression, a crucial process for the formation of an electric organ discharge. Unraveling the genetic basis of the species-specific modulation of the electric organ discharge in Campylomormyrus is crucial for understanding the adaptive radiation of this emerging model taxon of ecological (perhaps even sympatric) speciation.


Asunto(s)
Pez Eléctrico , Animales , Pez Eléctrico/genética , Alelos , Órgano Eléctrico/metabolismo , Regulación hacia Arriba , Canales de Potasio/genética
8.
Mol Biol Evol ; 41(5)2024 May 03.
Artículo en Inglés | MEDLINE | ID: mdl-38577958

RESUMEN

Estimating the distribution of fitness effects (DFE) of new mutations is of fundamental importance in evolutionary biology, ecology, and conservation. However, existing methods for DFE estimation suffer from limitations, such as slow computation speed and limited scalability. To address these issues, we introduce fastDFE, a Python-based software package, offering fast, and flexible DFE inference from site-frequency spectrum (SFS) data. Apart from providing efficient joint inference of multiple DFEs that share parameters, it offers the feature of introducing genomic covariates that influence the DFEs and testing their significance. To further simplify usage, fastDFE is equipped with comprehensive VCF-to-SFS parsing utilities. These include options for site filtering and stratification, as well as site-degeneracy annotation and probabilistic ancestral-allele inference. fastDFE thereby covers the entire workflow of DFE inference from the moment of acquiring a raw VCF file. Despite its Python foundation, fastDFE incorporates a full R interface, including native R visualization capabilities. The package is comprehensively tested and documented at fastdfe.readthedocs.io.


Asunto(s)
Aptitud Genética , Programas Informáticos , Mutación , Modelos Genéticos
9.
Mol Biol Evol ; 41(7)2024 Jul 03.
Artículo en Inglés | MEDLINE | ID: mdl-38934796

RESUMEN

Plant cells harbor two membrane-bound organelles containing their own genetic material-plastids and mitochondria. Although the two organelles coexist and coevolve within the same plant cells, they differ in genome copy number, intracellular organization, and mode of segregation. How these attributes affect the time to fixation or, conversely, loss of neutral alleles is currently unresolved. Here, we show that mitochondria and plastids share the same mutation rate, yet plastid alleles remain in a heteroplasmic state significantly longer compared with mitochondrial alleles. By analyzing genetic variants across populations of the marine flowering plant Zostera marina and simulating organelle allele dynamics, we examine the determinants of allele segregation and allele fixation. Our results suggest that the bottlenecks on the cell population, e.g. during branching or seeding, and stratification of the meristematic tissue are important determinants of mitochondrial allele dynamics. Furthermore, we suggest that the prolonged plastid allele dynamics are due to a yet unknown active plastid partition mechanism. The dissimilarity between plastid and mitochondrial novel allele fixation at different levels of organization may manifest in differences in adaptation processes. Our study uncovers fundamental principles of organelle population genetics that are essential for further investigations of long-term evolution and molecular dating of divergence events.


Asunto(s)
Heteroplasmia , Mitocondrias , Tasa de Mutación , Plastidios , Plastidios/genética , Mitocondrias/genética , Mitocondrias/metabolismo , Alelos
10.
Hum Genomics ; 18(1): 52, 2024 May 24.
Artículo en Inglés | MEDLINE | ID: mdl-38790075

RESUMEN

The recent article by Harit et al. in Human Genomics reported a novel association of the C allele of rs479200 in the human EGLN1 gene with severe COVID-19 in Indian patients. The gene in context is an oxygen-sensor gene whose T allele has been reported to contribute to the inability to cope with hypoxia due to increased expression of the EGLN1 gene and therefore persons with TT genotype of EGLN1 rs479200 are more susceptible to severe manifestations of hypoxia. In contrast to this dogma, Harit et al. showed that the C allele is associated with the worsening of COVID-19 hypoxia without suggesting or even discussing the scientific plausibility of the association. The article also suffers from certain epidemiological, statistical, and mathematical issues that need to be critically elaborated and discussed. In this context, the findings of Harit et al. may be re-evaluated.


Asunto(s)
COVID-19 , Predisposición Genética a la Enfermedad , Prolina Dioxigenasas del Factor Inducible por Hipoxia , SARS-CoV-2 , Humanos , Alelos , COVID-19/genética , COVID-19/epidemiología , COVID-19/virología , Genotipo , Hipoxia/genética , Prolina Dioxigenasas del Factor Inducible por Hipoxia/genética , India/epidemiología , Polimorfismo de Nucleótido Simple/genética , SARS-CoV-2/genética , SARS-CoV-2/patogenicidad , Índice de Severidad de la Enfermedad
11.
Hum Genomics ; 18(1): 40, 2024 Apr 22.
Artículo en Inglés | MEDLINE | ID: mdl-38650020

RESUMEN

BACKGROUND: CYP2C8 is responsible for the metabolism of 5% of clinically prescribed drugs, including antimalarials, anti-cancer and anti-inflammatory drugs. Genetic variability is an important factor that influences CYP2C8 activity and modulates the pharmacokinetics, efficacy and safety of its substrates. RESULTS: We profiled the genetic landscape of CYP2C8 variability using data from 96 original studies and data repositories that included a total of 33,185 unrelated participants across 44 countries and 43 ethnic groups. The reduced function allele CYP2C8*2 was most common in West and Central Africa with frequencies of 16-36.9%, whereas it was rare in Europe and Asia (< 2%). In contrast, CYP2C8*3 and CYP2C8*4 were common throughout Europe and the Americas (6.9-19.8% for *3 and 2.3-7.5% for *4), but rare in African and East Asian populations. Importantly, we observe pronounced differences (> 2.3-fold) between neighboring countries and even between geographically overlapping populations. Overall, we found that 20-60% of individuals in Africa and Europe carry at least one CYP2C8 allele associated with reduced metabolism and increased adverse event risk of the anti-malarial amodiaquine. Furthermore, up to 60% of individuals of West African ancestry harbored variants that reduced the clearance of pioglitazone, repaglinide, paclitaxel and ibuprofen. In contrast, reduced function alleles are only found in < 2% of East Asian and 8.3-12.8% of South and West Asian individuals. CONCLUSIONS: Combined, the presented analyses mapped the genetic and inferred functional variability of CYP2C8 with high ethnogeographic resolution. These results can serve as a valuable resource for CYP2C8 allele frequencies and distribution estimates of CYP2C8 phenotypes that could help identify populations at risk upon treatment with CYP2C8 substrates. The high variability between ethnic groups incentivizes high-resolution pharmacogenetic profiling to guide precision medicine and maximize its socioeconomic benefits, particularly for understudied populations with distinct genetic profiles.


Asunto(s)
Alelos , Carbamatos , Citocromo P-450 CYP2C8 , Piperidinas , Citocromo P-450 CYP2C8/genética , Humanos , Frecuencia de los Genes/genética , Polimorfismo de Nucleótido Simple/genética , Europa (Continente) , Tiazolidinedionas/efectos adversos
12.
BMC Bioinformatics ; 25(1): 109, 2024 Mar 12.
Artículo en Inglés | MEDLINE | ID: mdl-38475727

RESUMEN

BACKGROUND: Parent-of-origin allele-specific gene expression (ASE) can be detected in interspecies hybrids by virtue of RNA sequence variants between the parental haplotypes. ASE is detectable by differential expression analysis (DEA) applied to the counts of RNA-seq read pairs aligned to parental references, but aligners do not always choose the correct parental reference. RESULTS: We used public data for species that are known to hybridize. We measured our ability to assign RNA-seq read pairs to their proper transcriptome or genome references. We tested software packages that assign each read pair to a reference position and found that they often favored the incorrect species reference. To address this problem, we introduce a post process that extracts alignment features and trains a random forest classifier to choose the better alignment. On each simulated hybrid dataset tested, our machine-learning post-processor achieved higher accuracy than the aligner by itself at choosing the correct parent-of-origin per RNA-seq read pair. CONCLUSIONS: For the parent-of-origin classification of RNA-seq, machine learning can improve the accuracy of alignment-based methods. This approach could be useful for enhancing ASE detection in interspecies hybrids, though RNA-seq from real hybrids may present challenges not captured by our simulations. We believe this is the first application of machine learning to this problem domain.


Asunto(s)
Programas Informáticos , Transcriptoma , RNA-Seq , Análisis de Secuencia de ARN/métodos , Aprendizaje Automático
13.
BMC Bioinformatics ; 25(1): 91, 2024 Mar 01.
Artículo en Inglés | MEDLINE | ID: mdl-38429654

RESUMEN

BACKGROUND: Uncovering functional genetic variants from an allele-specific perspective is of paramount importance in advancing our understanding of gene regulation and genetic diseases. Recently, various allele-specific events, such as allele-specific gene expression, allele-specific methylation, and allele-specific binding, have been explored on a genome-wide scale due to the development of high-throughput sequencing methods. RNA secondary structure, which plays a crucial role in multiple RNA-associated processes like RNA modification, translation and splicing, has emerged as an essential focus of relevant research. However, tools to identify genetic variants associated with allele-specific RNA secondary structures are still lacking. RESULTS: Here, we develop a computational tool called 'AStruct' that enables us to detect allele-specific RNA secondary structure (ASRS) from RT-stop based structuromic probing data. AStruct shows robust performance in both simulated datasets and public icSHAPE datasets. We reveal that single nucleotide polymorphisms (SNPs) with higher AStruct scores are enriched in coding regions and tend to be functional. These SNPs are highly conservative, have the potential to disrupt sites involved in m6A modification or protein binding, and are frequently associated with disease. CONCLUSIONS: AStruct is a tool dedicated to invoke allele-specific RNA secondary structure events at heterozygous SNPs in RT-stop based structuromic probing data. It utilizes allelic variants, base pairing and RT-stop information under different cell conditions to detect dynamic and functional ASRS. Compared to sequence-based tools, AStruct considers dynamic cell conditions and outperforms in detecting functional variants. AStruct is implemented in JAVA and is freely accessible at: https://github.com/canceromics/AStruct .


Asunto(s)
Regulación de la Expresión Génica , ARN , ARN/genética , ARN/química , Alelos , Empalme del ARN , Secuenciación de Nucleótidos de Alto Rendimiento/métodos
14.
BMC Genomics ; 25(1): 476, 2024 May 14.
Artículo en Inglés | MEDLINE | ID: mdl-38745122

RESUMEN

BACKGROUND: Heterosis has successfully enhanced maize productivity and quality. Although significant progress has been made in delineating the genetic basis of heterosis, the molecular mechanisms underlying its genetic components remain less explored. Allele-specific expression (ASE), the imbalanced expression between two parental alleles in hybrids, is increasingly being recognized as a factor contributing to heterosis. ASE is a complex process regulated by both epigenetic and genetic variations in response to developmental and environmental conditions. RESULTS: In this study, we explored the differential characteristics of ASE by analyzing the transcriptome data of two maize hybrids and their parents under four light conditions. On the basis of allele expression patterns in different hybrids under various conditions, ASE genes were divided into three categories: bias-consistent genes involved in basal metabolic processes in a functionally complementary manner, bias-reversal genes adapting to the light environment, and bias-specific genes maintaining cell homeostasis. We observed that 758 ASE genes (ASEGs) were significantly overlapped with heterosis quantitative trait loci (QTLs), and high-frequency variations in the promoter regions of heterosis-related ASEGs were identified between parents. In addition, 10 heterosis-related ASEGs participating in yield heterosis were selected during domestication. CONCLUSIONS: The comprehensive analysis of ASEGs offers a distinctive perspective on how light quality influences gene expression patterns and gene-environment interactions, with implications for the identification of heterosis-related ASEGs to enhance maize yield.


Asunto(s)
Alelos , Regulación de la Expresión Génica de las Plantas , Vigor Híbrido , Regiones Promotoras Genéticas , Sitios de Carácter Cuantitativo , Zea mays , Zea mays/genética , Zea mays/metabolismo , Vigor Híbrido/genética , Perfilación de la Expresión Génica , Variación Genética , Transcriptoma
15.
BMC Genomics ; 25(1): 487, 2024 May 16.
Artículo en Inglés | MEDLINE | ID: mdl-38755557

RESUMEN

BACKGROUND: The identification of low-frequency haplotypes, never observed in homozygous state in a population, is considered informative on the presence of potentially harmful alleles (candidate alleles), putatively involved in inbreeding depression. Although identification of candidate alleles is challenging, studies analyzing the dynamics of potentially harmful alleles are lacking. A pedigree of the highly endangered Gochu Asturcelta pig breed, including 471 individuals belonging to 51 different families with at least 5 offspring each, was genotyped using the Axiom PigHDv1 Array (658,692 SNPs). Analyses were carried out on four different cohorts defined according to pedigree depth and at the whole population (WP) level. RESULTS: The 4,470 Linkage Blocks (LB) identified in the Base Population (10 individuals), gathered a total of 16,981 alleles in the WP. Up to 5,466 (32%) haplotypes were statistically considered candidate alleles, 3,995 of them (73%) having one copy only. The number of alleles and candidate alleles varied across cohorts according to sample size. Up to 4,610 of the alleles identified in the WP (27% of the total) were present in one cohort only. Parentage analysis identified a total of 67,742 parent-offspring incompatibilities. The number of mismatches varied according to family size. Parent-offspring inconsistencies were identified in 98.2% of the candidate alleles and 100% of the LB in which they were located. Segregation analyses informed that most potential candidate alleles appeared de novo in the pedigree. Only 17 candidate alleles were identified in the boar, sow, and paternal and maternal grandparents and were considered segregants. CONCLUSIONS: Our results suggest that neither mutation nor recombination are the major forces causing the apparition of candidate alleles. Their occurrence is more likely caused by Allele-Drop-In events due to SNP calling errors. New alleles appear when wrongly called SNPs are used to construct haplotypes. The presence of candidate alleles in either parents or grandparents of the carrier individuals does not ensure that they are true alleles. Minimum Allele Frequency thresholds may remove informative alleles. Only fully segregant candidate alleles should be considered potentially harmful alleles. A set of 16 candidate genes, potentially involved in inbreeding depression, is described.


Asunto(s)
Alelos , Haplotipos , Linaje , Polimorfismo de Nucleótido Simple , Animales , Porcinos/genética , Dinámica Poblacional , Femenino , Masculino , Frecuencia de los Genes
16.
BMC Genomics ; 25(1): 48, 2024 Jan 10.
Artículo en Inglés | MEDLINE | ID: mdl-38200446

RESUMEN

BACKGROUND: Human mitochondrial heteroplasmy is an extensively investigated phenomenon in the context of medical diagnostics, forensic identification and molecular evolution. However, technical limitations of high-throughput sequencing hinder reliable determination of point heteroplasmies (PHPs) with minor allele frequencies (MAFs) within the noise threshold. RESULTS: To investigate the PHP landscape at an MAF threshold down to 0.1%, we sequenced whole mitochondrial genomes at approximately 7.700x coverage, in multiple technical and biological replicates of longitudinal blood and buccal swab samples from 11 human donors (159 libraries in total). The results obtained by two independent sequencing platforms and bioinformatics pipelines indicate distinctive PHP patterns below and above the 1% MAF cut-off. We found a high inter-individual prevalence of low-level PHPs (MAF < 1%) at polymorphic positions of the mitochondrial DNA control region (CR), their tissue preference, and a tissue-specific minor allele linkage. We also established the position-dependent potential of minor allele expansion in PHPs, and short-term PHP instability in a mitotically active tissue. We demonstrate that the increase in sensitivity of PHP detection to minor allele frequencies below 1% within a robust experimental and analytical pipeline, provides new information with potential applicative value. CONCLUSIONS: Our findings reliably show different mutational loads between tissues at sub-1% allele frequencies, which may serve as an informative medical biomarker of time-dependent, tissue-specific mutational burden, or help discriminate forensically relevant tissues in a single person, close maternal relatives or unrelated individuals of similar phylogenetic background.


Asunto(s)
Heteroplasmia , Mitocondrias , Humanos , Filogenia , Mitocondrias/genética , Secuenciación de Nucleótidos de Alto Rendimiento , ADN Mitocondrial/genética
17.
BMC Genomics ; 25(1): 65, 2024 Jan 16.
Artículo en Inglés | MEDLINE | ID: mdl-38229017

RESUMEN

BACKGROUND: Pod shell thickness (PST) is an important agronomic trait of peanut because it affects the ability of shells to resist pest infestations and pathogen attacks, while also influencing the peanut shelling process. However, very few studies have explored the genetic basis of PST. RESULTS: An F2 segregating population derived from a cross between the thick-shelled cultivar Yueyou 18 (YY18) and the thin-shelled cultivar Weihua 8 (WH8) was used to identify the quantitative trait loci (QTLs) for PST. On the basis of a bulked segregant analysis sequencing (BSA-seq), four QTLs were preliminarily mapped to chromosomes 3, 8, 13, and 18. Using the genome resequencing data of YY18 and WH8, 22 kompetitive allele-specific PCR (KASP) markers were designed for the genotyping of the F2 population. Two major QTLs (qPSTA08 and qPSTA18) were identified and finely mapped, with qPSTA08 detected on chromosome 8 (0.69-Mb physical genomic region) and qPSTA18 detected on chromosome 18 (0.15-Mb physical genomic region). Moreover, qPSTA08 and qPSTA18 explained 31.1-32.3% and 16.7-16.8% of the phenotypic variation, respectively. Fifteen genes were detected in the two candidate regions, including three genes with nonsynonymous mutations in the exon region. Two molecular markers (Tif2_A08_31713024 and Tif2_A18_7198124) that were developed for the two major QTL regions effectively distinguished between thick-shelled and thin-shelled materials. Subsequently, the two markers were validated in four F2:3 lines selected. CONCLUSIONS: The QTLs identified and molecular markers developed in this study may lay the foundation for breeding cultivars with a shell thickness suitable for mechanized peanut shelling.


Asunto(s)
Arachis , Sitios de Carácter Cuantitativo , Arachis/genética , Mapeo Cromosómico , Fitomejoramiento , Fenotipo
18.
Ann Hum Genet ; 2024 Jun 19.
Artículo en Inglés | MEDLINE | ID: mdl-38895879

RESUMEN

INTRODUCTION: Iran, a country in the Middle East, has several ethnic and ethno-religious groups and needs its own ethnic-specific databases for the forensic statistical parameters and allele frequency of STR markers. METHODS: We have investigated 600 unrelated Turk individuals from four northwestern provinces of Iran using the Identifiler™ system (TPOX, FGA, vWA, TH01, CSF1PO, D2S1338, D3S1358, D5S818, D7S820, D8S1179, D13S317, D16S539, D18S51, D19S433, and D21S11). Furthermore, STR allelic frequencies were compared to previously population-based data. RESULTS AND CONCLUSION: After Bonferroni correction, deviation from Hardy-Weinberg equilibrium (HWE) was observed in the FGA, TPOX, VWA, and D19S433 loci (P value < 0.05). The combined power of discrimination (CPD) and exclusion (CPE) values for all 15 STR loci were 0.9999999999999999999984 and 0.9999999, respectively. In comparison with Azerbaijani and Turkish populations, there were no significant differences on all STR markers. However, in the Chinese Han population, differences at 13 STR loci were detected. Additionally, comparisons of Fischer genetic distance indices (FST) P-values did not reveal any statistically significant difference between Northwestern Iran, Azerbaijan and Iran (Fars) populations. PCA and PCoA analyses showed that our population was grouped with different populations in different quarters, showing a positive and negative correlation, respectively. In the NJ and UPGMA phylogenetic trees, Iranian populations were grouped together. These results demonstrated that the given set of STR markers can be confidently used for all identification tests in Northwestern Iran.

19.
Funct Integr Genomics ; 24(3): 104, 2024 May 20.
Artículo en Inglés | MEDLINE | ID: mdl-38764005

RESUMEN

Accurate estimation of population allele frequency (AF) is crucial for gene discovery and genetic diagnostics. However, determining AF for frameshift-inducing small insertions and deletions (indels) faces challenges due to discrepancies in mapping and variant calling methods. Here, we propose an innovative approach to assess indel AF. We developed CRAFTS-indels (Calculating Regional Allele Frequency Targeting Small indels), an algorithm that combines AF of distinct indels within a given region and provides "regional AF" (rAF). We tested and validated CRAFTS-indels using three independent datasets: gnomAD v2 (n=125,748 samples), an internal dataset (IGM; n=39,367), and the UK BioBank (UKBB; n=469,835). By comparing rAF against standard AF, we identified rare indels with rAF exceeding standard AF (sAF≤10-4 and rAF>10-4) as "rAF-hi" indels. Notably, a high percentage of rare indels were "rAF-hi", with a higher proportion in gnomAD v2 (11-20%) and IGM (11-22%) compared to the UKBB (5-9% depending on the CRAFTS-indels' parameters). Analysis of the overlap of regions based on their rAF with low complexity regions and with ClinVar classification supported the pertinence of rAF. Using the internal dataset, we illustrated the utility of CRAFTS-indel in the analysis of de novo variants and the potential negative impact of rAF-hi indels in gene discovery. In summary, annotation of indels with cohort specific rAF can be used to handle some of the limitations of current annotation pipelines and facilitate detection of novel gene disease associations. CRAFTS-indels offers a user-friendly approach to providing rAF annotation. It can be integrated into public databases such as gnomAD, UKBB and used by ClinVar to revise indel classifications.


Asunto(s)
Frecuencia de los Genes , Mutación INDEL , Humanos , Algoritmos
20.
Am Nat ; 203(2): 292-304, 2024 02.
Artículo en Inglés | MEDLINE | ID: mdl-38306286

RESUMEN

AbstractBiological adaptation is the outcome of allele-frequency change by natural selection. At the same time, populations are usually class structured as individuals occupy different states, such as age, sex, or stage. This is known to result in the differential transmission of alleles through nonheritable fitness differences called class transmission, which also affects allele-frequency change even in the absence of selection. How does one then isolate allele-frequency change due to selection from that due to class transmission? We decompose one-generational allele-frequency change in terms of effects of selection and class transmission and show how reproductive values can be used to reach a decomposition between any two distant generations of the evolutionary process. This provides a missing relationship between multigenerational allele-frequency change and the operation of selection. It also allows a measure of fitness to be defined summarizing the effect of selection in a multigenerational evolutionary process, which connects asymptotically to invasion fitness.


Asunto(s)
Modelos Genéticos , Selección Genética , Humanos , Frecuencia de los Genes , Reproducción , Evolución Biológica
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA