Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 5 de 5
Filter
1.
Am J Hum Genet ; 111(8): 1524-1543, 2024 Aug 08.
Article in English | MEDLINE | ID: mdl-39053458

ABSTRACT

Gene misexpression is the aberrant transcription of a gene in a context where it is usually inactive. Despite its known pathological consequences in specific rare diseases, we have a limited understanding of its wider prevalence and mechanisms in humans. To address this, we analyzed gene misexpression in 4,568 whole-blood bulk RNA sequencing samples from INTERVAL study blood donors. We found that while individual misexpression events occur rarely, in aggregate they were found in almost all samples and a third of inactive protein-coding genes. Using 2,821 paired whole-genome and RNA sequencing samples, we identified that misexpression events are enriched in cis for rare structural variants. We established putative mechanisms through which a subset of SVs lead to gene misexpression, including transcriptional readthrough, transcript fusions, and gene inversion. Overall, we develop misexpression as a type of transcriptomic outlier analysis and extend our understanding of the variety of mechanisms by which genetic variants can influence gene expression.


Subject(s)
Gene Expression Regulation , Humans , Sequence Analysis, RNA , Genetic Variation , Genomic Structural Variation/genetics , Transcriptome/genetics , Blood Donors
2.
bioRxiv ; 2024 Jun 13.
Article in English | MEDLINE | ID: mdl-38370750

ABSTRACT

The adoption of agriculture, first documented ~12,000 years ago in the Fertile Crescent, triggered a rapid shift toward starch-rich diets in human populations. Amylase genes facilitate starch digestion and increased salivary amylase copy number has been observed in some modern human populations with high starch intake, though evidence of recent selection is lacking. Here, using 52 long-read diploid assemblies and short read data from ~5,600 contemporary and ancient humans, we resolve the diversity, evolutionary history, and selective impact of structural variation at the amylase locus. We find that amylase genes have higher copy numbers in populations with agricultural subsistence compared to fishing, hunting, and pastoral groups. We identify 28 distinct amylase structural architectures and demonstrate that nearly identical structures have arisen recurrently on different haplotype backgrounds throughout recent human history. AMY1 and AMY2A genes each exhibit multiple duplications/deletions with mutation rates >10,000-fold the SNP mutation rate, whereas AMY2B gene duplications share a single origin. Using a pangenome graph-based approach to infer structural haplotypes across thousands of humans, we identify extensively duplicated haplotypes present at higher frequencies in modern day populations with traditionally agricultural diets. Leveraging 533 ancient human genomes we find that duplication-containing haplotypes (i.e. haplotypes with more amylase gene copies than the ancestral haplotype) have increased in frequency more than seven-fold over the last 12,000 years providing evidence for recent selection in West Eurasians. Together, our study highlights the potential impacts of the agricultural revolution on human genomes and the importance of long-read sequencing in identifying signatures of selection at structurally complex loci.

3.
Nat Genet ; 56(2): 273-280, 2024 Feb.
Article in English | MEDLINE | ID: mdl-38233595

ABSTRACT

Myeloproliferative neoplasms (MPNs) are chronic cancers characterized by overproduction of mature blood cells. Their causative somatic mutations, for example, JAK2V617F, are common in the population, yet only a minority of carriers develop MPN. Here we show that the inherited polygenic loci that underlie common hematological traits influence JAK2V617F clonal expansion. We identify polygenic risk scores (PGSs) for monocyte count and plateletcrit as new risk factors for JAK2V617F positivity. PGSs for several hematological traits influenced the risk of different MPN subtypes, with low PGSs for two platelet traits also showing protective effects in JAK2V617F carriers, making them two to three times less likely to have essential thrombocythemia than carriers with high PGSs. We observed that extreme hematological PGSs may contribute to an MPN diagnosis in the absence of somatic driver mutations. Our study showcases how polygenic backgrounds underlying common hematological traits influence both clonal selection on somatic mutations and the subsequent phenotype of cancer.


Subject(s)
Myeloproliferative Disorders , Neoplasms , Humans , Mutation , Myeloproliferative Disorders/genetics , Myeloproliferative Disorders/diagnosis , Phenotype , Janus Kinase 2/genetics , Genetic Risk Score
4.
Cell Genom ; 4(7): 100587, 2024 Jul 10.
Article in English | MEDLINE | ID: mdl-38897207

ABSTRACT

Sepsis is a clinical syndrome of life-threatening organ dysfunction caused by a dysregulated response to infection, for which disease heterogeneity is a major obstacle to developing targeted treatments. We have previously identified gene-expression-based patient subgroups (sepsis response signatures [SRS]) informative for outcome and underlying pathophysiology. Here, we aimed to investigate the role of genetic variation in determining the host transcriptomic response and to delineate regulatory networks underlying SRS. Using genotyping and RNA-sequencing data on 638 adult sepsis patients, we report 16,049 independent expression (eQTLs) and 32 co-expression module (modQTLs) quantitative trait loci in this disease context. We identified significant interactions between SRS and genotype for 1,578 SNP-gene pairs and combined transcription factor (TF) binding site information (SNP2TFBS) and predicted regulon activity (DoRothEA) to identify candidate upstream regulators. Overall, these approaches identified putative mechanistic links between host genetic variation, cell subtypes, and the individual transcriptomic response to infection.


Subject(s)
Gene Regulatory Networks , Polymorphism, Single Nucleotide , Quantitative Trait Loci , Sepsis , Humans , Sepsis/genetics , Gene Regulatory Networks/genetics , Transcription Factors/genetics , Transcription Factors/metabolism , Male , Female , Transcriptome , Middle Aged , Adult , Genotype
5.
Nat Genet ; 56(6): 1090-1099, 2024 Jun.
Article in English | MEDLINE | ID: mdl-38839884

ABSTRACT

Restless legs syndrome (RLS) affects up to 10% of older adults. Their healthcare is impeded by delayed diagnosis and insufficient treatment. To advance disease prediction and find new entry points for therapy, we performed meta-analyses of genome-wide association studies in 116,647 individuals with RLS (cases) and 1,546,466 controls of European ancestry. The pooled analysis increased the number of risk loci eightfold to 164, including three on chromosome X. Sex-specific meta-analyses revealed largely overlapping genetic predispositions of the sexes (rg = 0.96). Locus annotation prioritized druggable genes such as glutamate receptors 1 and 4, and Mendelian randomization indicated RLS as a causal risk factor for diabetes. Machine learning approaches combining genetic and nongenetic information performed best in risk prediction (area under the curve (AUC) = 0.82-0.91). In summary, we identified targets for drug development and repurposing, prioritized potential causal relationships between RLS and relevant comorbidities and risk factors for follow-up and provided evidence that nonlinear interactions are likely relevant to RLS risk prediction.


Subject(s)
Genetic Predisposition to Disease , Genome-Wide Association Study , Restless Legs Syndrome , Restless Legs Syndrome/genetics , Humans , Risk Factors , Female , Male , Polymorphism, Single Nucleotide , Mendelian Randomization Analysis , Machine Learning
SELECTION OF CITATIONS
SEARCH DETAIL