Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 6 de 6
Filter
Add more filters








Database
Language
Publication year range
1.
Nucleic Acids Res ; 51(5): 2377-2396, 2023 03 21.
Article in English | MEDLINE | ID: mdl-36727459

ABSTRACT

Translation is a key determinant of gene expression and an important biotechnological engineering target. In bacteria, 5'-untranslated region (5'-UTR) and coding sequence (CDS) are well-known mRNA parts controlling translation and thus cellular protein levels. However, the complex interaction of 5'-UTR and CDS has so far only been studied for few sequences leading to non-generalisable and partly contradictory conclusions. Herein, we systematically assess the dynamic translation from over 1.2 million 5'-UTR-CDS pairs in Escherichia coli to investigate their collective effect using a new method for ultradeep sequence-function mapping. This allows us to disentangle and precisely quantify effects of various sequence determinants of translation. We find that 5'-UTR and CDS individually account for 53% and 20% of variance in translation, respectively, and show conclusively that, contrary to a common hypothesis, tRNA abundance does not explain expression changes between CDSs with different synonymous codons. Moreover, the obtained large-scale data provide clear experimental evidence for a base-pairing interaction between initiator tRNA and mRNA beyond the anticodon-codon interaction, an effect that is often masked for individual sequences and therefore inaccessible to low-throughput approaches. Our study highlights the indispensability of ultradeep sequence-function mapping to accurately determine the contribution of parts and phenomena involved in gene regulation.


Subject(s)
RNA, Transfer, Met , RNA, Transfer , Base Pairing , RNA, Transfer, Met/genetics , RNA, Transfer, Met/metabolism , Codon/genetics , RNA, Messenger/genetics , RNA, Messenger/metabolism , RNA, Transfer/genetics , RNA, Transfer/metabolism , Anticodon , Protein Biosynthesis/genetics
2.
Nat Commun ; 11(1): 3551, 2020 07 15.
Article in English | MEDLINE | ID: mdl-32669542

ABSTRACT

Predicting effects of gene regulatory elements (GREs) is a longstanding challenge in biology. Machine learning may address this, but requires large datasets linking GREs to their quantitative function. However, experimental methods to generate such datasets are either application-specific or technically complex and error-prone. Here, we introduce DNA-based phenotypic recording as a widely applicable, practicable approach to generate large-scale sequence-function datasets. We use a site-specific recombinase to directly record a GRE's effect in DNA, enabling readout of both sequence and quantitative function for extremely large GRE-sets via next-generation sequencing. We record translation kinetics of over 300,000 bacterial ribosome binding sites (RBSs) in >2.7 million sequence-function pairs in a single experiment. Further, we introduce a deep learning approach employing ensembling and uncertainty modelling that predicts RBS function with high accuracy, outperforming state-of-the-art methods. DNA-based phenotypic recording combined with deep learning represents a major advance in our ability to predict function from genetic sequence.


Subject(s)
Computational Biology/methods , Deep Learning , Molecular Sequence Annotation/methods , Phenotype , Sequence Analysis, DNA/methods , Binding Sites/genetics , Datasets as Topic , Escherichia coli/genetics , Gene Knockout Techniques , Genome, Bacterial/genetics , High-Throughput Nucleotide Sequencing , Regulatory Sequences, Nucleic Acid/genetics , Ribosomes/metabolism
4.
Nat Commun ; 9(1): 1946, 2018 05 16.
Article in English | MEDLINE | ID: mdl-29769528

ABSTRACT

The problem of the genetics of related phenotypes is often addressed by analyzing adjusted-model traits, but such traits warrant cautious interpretation. Here, we adopt a joint view of adiposity traits in ~322,154 subjects (GIANT consortium). We classify 159 signals associated with body mass index (BMI), waist-to-hip ratio (WHR), or WHR adjusted for BMI (WHRadjBMI) at P < 5 × 10-8, into four classes based on the direction of their effects on BMI and WHR. Our classes help differentiate adiposity genetics with respect to anthropometry, fat depots, and metabolic health. Class-specific Mendelian randomization reveals that variants associated with both WHR-decrease and BMI increase are linked to metabolically rather favorable adiposity through beneficial hip fat. Class-specific enrichment analyses implicate digestive systems as a pathway in adiposity genetics. Our results demonstrate that WHRadjBMI variants capture relevant effects of "unexpected fat distribution given the BMI" and that a joint view of the genetics underlying related phenotypes can inform on important biology.


Subject(s)
Adiposity/genetics , Body Mass Index , Genetic Variation , Waist-Hip Ratio , Adipose Tissue/metabolism , Energy Metabolism/genetics , Genome-Wide Association Study/methods , Humans , Meta-Analysis as Topic , Obesity/classification , Obesity/genetics , Obesity/metabolism , Polymorphism, Single Nucleotide
6.
Sci Rep ; 7: 45040, 2017 04 28.
Article in English | MEDLINE | ID: mdl-28452372

ABSTRACT

HapMap imputed genome-wide association studies (GWAS) have revealed >50 loci at which common variants with minor allele frequency >5% are associated with kidney function. GWAS using more complete reference sets for imputation, such as those from The 1000 Genomes project, promise to identify novel loci that have been missed by previous efforts. To investigate the value of such a more complete variant catalog, we conducted a GWAS meta-analysis of kidney function based on the estimated glomerular filtration rate (eGFR) in 110,517 European ancestry participants using 1000 Genomes imputed data. We identified 10 novel loci with p-value < 5 × 10-8 previously missed by HapMap-based GWAS. Six of these loci (HOXD8, ARL15, PIK3R1, EYA4, ASTN2, and EPB41L3) are tagged by common SNPs unique to the 1000 Genomes reference panel. Using pathway analysis, we identified 39 significant (FDR < 0.05) genes and 127 significantly (FDR < 0.05) enriched gene sets, which were missed by our previous analyses. Among those, the 10 identified novel genes are part of pathways of kidney development, carbohydrate metabolism, cardiac septum development and glucose metabolism. These results highlight the utility of re-imputing from denser reference panels, until whole-genome sequencing becomes feasible in large samples.


Subject(s)
Computational Biology/methods , Genetic Loci , Kidney/physiology , Gene Frequency , Genome, Human , Genome-Wide Association Study , Genotyping Techniques , Humans , Polymorphism, Single Nucleotide
SELECTION OF CITATIONS
SEARCH DETAIL