RESUMO
The genomes of malaria parasites contain many genes of unknown function. To assist drug development through the identification of essential genes and pathways, we have measured competitive growth rates in mice of 2,578 barcoded Plasmodium berghei knockout mutants, representing >50% of the genome, and created a phenotype database. At a single stage of its complex life cycle, P. berghei requires two-thirds of genes for optimal growth, the highest proportion reported from any organism and a probable consequence of functional optimization necessitated by genomic reductions during the evolution of parasitism. In contrast, extreme functional redundancy has evolved among expanded gene families operating at the parasite-host interface. The level of genetic redundancy in a single-celled organism may thus reflect the degree of environmental variation it experiences. In the case of Plasmodium parasites, this helps rationalize both the relative successes of drugs and the greater difficulty of making an effective vaccine.
Assuntos
Genoma de Protozoário , Plasmodium berghei/crescimento & desenvolvimento , Plasmodium berghei/genética , Animais , Evolução Biológica , Feminino , Técnicas de Inativação de Genes , Genes Essenciais , Interações Hospedeiro-Parasita , Redes e Vias Metabólicas , Camundongos , Camundongos Endogâmicos BALB C , Plasmodium berghei/metabolismo , Saccharomyces cerevisiae/genética , Toxoplasma/genética , Trypanosoma brucei brucei/genéticaRESUMO
Assigning functions to genes and learning how to control their expression are part of the foundation of cell biology and therapeutic development. An efficient and unbiased method to accomplish this is genetic screening, which historically required laborious clone generation and phenotyping and is still limited by scale today. The rapid technological progress on modulating gene function with CRISPR-Cas and measuring it in individual cells has now relaxed the major experimental constraints and enabled pooled screening with complex readouts from single cells. Here, we review the principles and practical considerations for pooled single-cell CRISPR screening. We discuss perturbation strategies, experimental model systems, matching the perturbation to the individual cells, reading out cell phenotypes, and data analysis. Our focus is on single-cell RNA sequencing and cell sorting-based readouts, including image-enabled cell sorting. We expect this transformative approach to fuel biomedical research for the next several decades.
Assuntos
Sistemas CRISPR-Cas , Genoma , Sistemas CRISPR-Cas/genética , Genoma/genética , Testes Genéticos/métodos , FenótipoRESUMO
Gene misexpression is the aberrant transcription of a gene in a context where it is usually inactive. Despite its known pathological consequences in specific rare diseases, we have a limited understanding of its wider prevalence and mechanisms in humans. To address this, we analyzed gene misexpression in 4,568 whole-blood bulk RNA sequencing samples from INTERVAL study blood donors. We found that while individual misexpression events occur rarely, in aggregate they were found in almost all samples and a third of inactive protein-coding genes. Using 2,821 paired whole-genome and RNA sequencing samples, we identified that misexpression events are enriched in cis for rare structural variants. We established putative mechanisms through which a subset of SVs lead to gene misexpression, including transcriptional readthrough, transcript fusions, and gene inversion. Overall, we develop misexpression as a type of transcriptomic outlier analysis and extend our understanding of the variety of mechanisms by which genetic variants can influence gene expression.
Assuntos
Regulação da Expressão Gênica , Humanos , Análise de Sequência de RNA , Variação Genética , Variação Estrutural do Genoma/genética , Transcriptoma/genética , Doadores de SangueRESUMO
CRISPR/Cas base editors promise nucleotide-level control over DNA sequences, but the determinants of their activity remain incompletely understood. We measured base editing frequencies in two human cell lines for two cytosine and two adenine base editors at â¼14 000 target sequences and find that base editing activity is sequence-biased, with largest effects from nucleotides flanking the target base. Whether a base is edited depends strongly on the combination of its position in the target and the preceding base, acting to widen or narrow the effective editing window. The impact of features on editing rate depends on the position, with sequence bias efficacy mainly influencing bases away from the center of the window. We use these observations to train a machine learning model to predict editing activity per position, with accuracy ranging from 0.49 to 0.72 between editors, and with better generalization across datasets than existing tools. We demonstrate the usefulness of our model by predicting the efficacy of disease mutation correcting guides, and find that most of them suffer from more unwanted editing than pure outcomes. This work unravels the position-specificity of base editing biases and allows more efficient planning of editing campaigns in experimental and therapeutic contexts.
Assuntos
Sistemas CRISPR-Cas , Edição de Genes , Adenina , Citosina/metabolismo , Humanos , NucleotídeosRESUMO
Genome-wide CRISPR/Cas9 knockout screens are revolutionizing mammalian functional genomics. However, their range of applications remains limited by signal variability from different guide RNAs that target the same gene, which confounds gene effect estimation and dictates large experiment sizes. To address this problem, we report JACKS, a Bayesian method that jointly analyzes screens performed with the same guide RNA library. Modeling the variable guide efficacies greatly improves hit identification over processing a single screen at a time and outperforms existing methods. This more efficient analysis gives additional hits and allows designing libraries with a 2.5-fold reduction in required cell numbers without sacrificing performance compared to current analysis standards.
Assuntos
Sistemas CRISPR-Cas , Técnicas de Inativação de Genes/métodos , Software , Animais , Teorema de BayesRESUMO
The consequence of a mutation can be influenced by the context in which it operates. For example, loss of gene function may be tolerated in one genetic background, and lethal in another. The extent to which mutant phenotypes are malleable, the architecture of modifiers and the identities of causal genes remain largely unknown. Here, we measure the fitness effects of ~ 1,100 temperature-sensitive alleles of yeast essential genes in the context of variation from ten different natural genetic backgrounds and map the modifiers for 19 combinations. Altogether, fitness defects for 149 of the 580 tested genes (26%) could be suppressed by genetic variation in at least one yeast strain. Suppression was generally driven by gain-of-function of a single, strong modifier gene, and involved both genes encoding complex or pathway partners suppressing specific temperature-sensitive alleles, as well as general modifiers altering the effect of many alleles. The emerging frequency of suppression and range of possible mechanisms suggest that a substantial fraction of monogenic diseases could be managed by modulating other gene products.
Assuntos
Mutação com Ganho de Função , Genes Essenciais , Saccharomyces cerevisiae/crescimento & desenvolvimento , Regulação Fúngica da Expressão Gênica , Genes Modificadores , Variação Genética , Mutação , Fenótipo , Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/genéticaAssuntos
Pesquisa Biomédica/organização & administração , Pesquisa Biomédica/tendências , Objetivos , Cooperação Internacional , Terapia de Alvo Molecular/tendências , Neoplasias/tratamento farmacológico , Neoplasias/genética , Antineoplásicos Imunológicos/farmacologia , Antineoplásicos Imunológicos/uso terapêutico , Pesquisa Biomédica/economia , Sistemas CRISPR-Cas/genética , Sobrevivência Celular/efeitos dos fármacos , Análise Mutacional de DNA , Conjuntos de Dados como Assunto , Feminino , Regulação Neoplásica da Expressão Gênica/efeitos dos fármacos , Humanos , Disseminação de Informação , Aprendizado de Máquina , Mutação , Neoplasias/metabolismo , Neoplasias/patologia , Projetos Piloto , Medicina de Precisão , Reprodutibilidade dos TestesRESUMO
The Klebsiella pneumoniae species complex includes important opportunistic pathogens which have become public health priorities linked to major hospital outbreaks and the recent emergence of multidrug-resistant hypervirulent strains. Bacterial virulence and the spread of multidrug resistance have previously been linked to toxin-antitoxin (TA) systems. TA systems encode a toxin that disrupts essential cellular processes, and a cognate antitoxin which counteracts this activity. Whilst associated with the maintenance of plasmids, they also act in bacterial immunity and antibiotic tolerance. However, the evolutionary dynamics and distribution of TA systems in clinical pathogens are not well understood. Here, we present a comprehensive survey and description of the diversity of TA systems in 259 clinically relevant genomes of K. pneumoniae. We show that TA systems are highly prevalent with a median of 20 loci per strain. Importantly, these toxins differ substantially in their distribution patterns and in their range of cognate antitoxins. Classification along these properties suggests different roles of TA systems and highlights the association and co-evolution of toxins and antitoxins.
Assuntos
Evolução Molecular , Klebsiella pneumoniae/genética , Sistemas Toxina-Antitoxina/genética , Simulação por Computador , Farmacorresistência Bacteriana/genética , Genoma Bacteriano , Klebsiella pneumoniae/efeitos dos fármacos , Klebsiella pneumoniae/patogenicidade , Fenótipo , Fatores de Virulência/genéticaRESUMO
Identifying nuclei is a standard first step when analysing cells in microscopy images. The traditional approach relies on signal from a DNA stain, or fluorescent transgene expression localised to the nucleus. However, imaging techniques that do not use fluorescence can also carry useful information. Here, we used brightfield and fluorescence images of fixed cells with fluorescently labelled DNA, and confirmed that three convolutional neural network architectures can be adapted to segment nuclei from the brightfield channel, relying on fluorescence signal to extract the ground truth for training. We found that U-Net achieved the best overall performance, Mask R-CNN provided an additional benefit of instance segmentation, and that DeepCell proved too slow for practical application. We trained the U-Net architecture on over 200 dataset variations, established that accurate segmentation is possible using as few as 16 training images, and that models trained on images from similar cell lines can extrapolate well. Acquiring data from multiple focal planes further helps distinguish nuclei in the samples. Overall, our work helps to liberate a fluorescence channel reserved for nuclear staining, thus providing more information from the specimen, and reducing reagents and time required for preparing imaging experiments.
Assuntos
Processamento de Imagem Assistida por Computador , Redes Neurais de Computação , Núcleo CelularRESUMO
Gene arrays and operons that encode functionally linked proteins form the most basic unit of transcriptional regulation in bacteria. Rules that govern the order and orientation of genes in these systems have been defined; however, these were based on a small set of genomes that may not be representative. The growing availability of large genomic datasets presents an opportunity to test these rules, to define the full range and diversity of these systems, and to understand their evolution. Here we present SLING, a tool to Search for LINked Genes by searching for a single functionally essential gene, along with its neighbours in a rule-defined proximity (https://github.com/ghoresh11/sling/wiki). Examining this subset of genes enables us to understand the basic diversity of these genetic systems in large datasets. We demonstrate the utility of SLING on a clinical collection of enteropathogenic Escherichia coli for two relevant operons: toxin antitoxin (TA) systems and RND efflux pumps. By examining the diversity of these systems, we gain insight on distinct classes of operons which present variable levels of prevalence and ability to be lost or gained. The importance of this analysis is not limited to TA systems and RND pumps, and can be expanded to understand the diversity of many other relevant gene arrays.
Assuntos
Proteínas de Bactérias/genética , Biologia Computacional/métodos , Genes Bacterianos/genética , Armazenamento e Recuperação da Informação/métodos , Óperon/genética , Antitoxinas/genética , Toxinas Bacterianas/genética , Bases de Dados Genéticas , Genoma Bacteriano/genética , Genômica/métodos , Internet , Reprodutibilidade dos TestesRESUMO
The emergence of microbial antibiotic resistance is a global health threat. In clinical settings, the key to controlling spread of resistant strains is accurate and rapid detection. As traditional culture-based methods are time consuming, genetic approaches have recently been developed for this task. The detection of antibiotic resistance is typically made by measuring a few known determinants previously identified from genome sequencing, and thus requires the prior knowledge of its biological mechanisms. To overcome this limitation, we employed machine learning models to predict resistance to 11 compounds across four classes of antibiotics from existing and novel whole genome sequences of 1936 E. coli strains. We considered a range of methods, and examined population structure, isolation year, gene content, and polymorphism information as predictors. Gradient boosted decision trees consistently outperformed alternative models with an average accuracy of 0.91 on held-out data (range 0.81-0.97). While the best models most frequently employed gene content, an average accuracy score of 0.79 could be obtained using population structure information alone. Single nucleotide variation data were less useful, and significantly improved prediction only for two antibiotics, including ciprofloxacin. These results demonstrate that antibiotic resistance in E. coli can be accurately predicted from whole genome sequences without a priori knowledge of mechanisms, and that both genomic and epidemiological data can be informative. This paves way to integrating machine learning approaches into diagnostic tools in the clinic.
Assuntos
Farmacorresistência Bacteriana/genética , Escherichia coli/genética , Análise de Sequência de DNA/métodos , Antibacterianos/farmacologia , DNA Bacteriano/genética , Farmacorresistência Bacteriana Múltipla/efeitos dos fármacos , Infecções por Escherichia coli , Previsões/métodos , Genoma/genética , Genoma Bacteriano , Humanos , Testes de Sensibilidade MicrobianaRESUMO
BACKGROUND: Low-temperature growth and fermentation of wine yeast can enhance wine aroma and make them highly desirable traits for the industry. Elucidating response to cold in Saccharomyces cerevisiae is, therefore, of paramount importance to select or genetically improve new wine strains. As most enological traits of industrial importance in yeasts, adaptation to low temperature is a polygenic trait regulated by many interacting loci. RESULTS: In order to unravel the genetic determinants of low-temperature fermentation, we mapped quantitative trait loci (QTLs) by bulk segregant analyses in the F13 offspring of two Saccharomyces cerevisiae industrial strains with divergent performance at low temperature. We detected four genomic regions involved in the adaptation at low temperature, three of them located in the subtelomeric regions (chromosomes XIII, XV and XVI) and one in the chromosome XIV. The QTL analysis revealed that subtelomeric regions play a key role in defining individual variation, which emphasizes the importance of these regions' adaptive nature. CONCLUSIONS: The reciprocal hemizygosity analysis (RHA), run to validate the genes involved in low-temperature fermentation, showed that genetic variation in mitochondrial proteins, maintenance of correct asymmetry and distribution of phospholipid in the plasma membrane are key determinants of low-temperature adaptation.
Assuntos
Adaptação Fisiológica/genética , Temperatura Baixa , Regulação Fúngica da Expressão Gênica , Saccharomyces cerevisiae/genética , Estresse Fisiológico/genética , Alelos , Mapeamento Cromossômico , Evolução Molecular , Fermentação/genética , Frequência do Gene , Estudos de Associação Genética , Genoma Fúngico , Genômica/métodos , Genótipo , Fenótipo , Filogenia , Locos de Características Quantitativas , Característica Quantitativa Herdável , Saccharomyces cerevisiae/classificação , Saccharomyces cerevisiae/metabolismoRESUMO
The genetic basis of heritable traits has been studied for decades. Although recent mapping efforts have elucidated genetic determinants of transcript levels, mapping of protein abundance has lagged. Here, we analyze levels of 4084 GFP-tagged yeast proteins in the progeny of a cross between a laboratory and a wild strain using flow cytometry and high-content microscopy. The genotype of trans variants contributed little to protein level variation between individual cells but explained >50% of the variance in the population's average protein abundance for half of the GFP fusions tested. To map trans-acting factors responsible, we performed flow sorting and bulk segregant analysis of 25 proteins, finding a median of five protein quantitative trait loci (pQTLs) per GFP fusion. Further, we find that cis-acting variants predominate; the genotype of a gene and its surrounding region had a large effect on protein level six times more frequently than the rest of the genome combined. We present evidence for both shared and independent genetic control of transcript and protein abundance: More than half of the expression QTLs (eQTLs) contribute to changes in protein levels of regulated genes, but several pQTLs do not affect their cognate transcript levels. Allele replacements of genes known to underlie trans eQTL hotspots confirmed the correlation of effects on mRNA and protein levels. This study represents the first genome-scale measurement of genetic contribution to protein levels in single cells and populations, identifies more than a hundred trans pQTLs, and validates the propagation of effects associated with transcript variation to protein abundance.
Assuntos
Proteínas de Saccharomyces cerevisiae/metabolismo , Saccharomyces cerevisiae/genética , Mapeamento Cromossômico , Evolução Molecular , Expressão Gênica , Frequência do Gene , Genótipo , Locos de Características Quantitativas , RNA Fúngico/genética , RNA Fúngico/metabolismo , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/genéticaRESUMO
Technological advances in genomics and imaging have led to an explosion of molecular and cellular profiling data from large numbers of samples. This rapid increase in biological data dimension and acquisition rate is challenging conventional analysis strategies. Modern machine learning methods, such as deep learning, promise to leverage very large data sets for finding hidden structure within them, and for making accurate predictions. In this review, we discuss applications of this new breed of analysis approaches in regulatory genomics and cellular imaging. We provide background of what deep learning is, and the settings in which it can be successfully applied to derive biological insights. In addition to presenting specific applications and providing tips for practical use, we also highlight possible pitfalls and limitations to guide computational biologists when and how to make the most use of this new technology.
Assuntos
Biologia Computacional/métodos , Genômica/métodos , Humanos , Aprendizado de Máquina , Modelos GenéticosRESUMO
The question of how genetic variation in a population influences phenotypic variation and evolution is of major importance in modern biology. Yet much is still unknown about the relative functional importance of different forms of genome variation and how they are shaped by evolutionary processes. Here we address these questions by population level sequencing of 42 strains from the budding yeast Saccharomyces cerevisiae and its closest relative S. paradoxus. We find that genome content variation, in the form of presence or absence as well as copy number of genetic material, is higher within S. cerevisiae than within S. paradoxus, despite genetic distances as measured in single-nucleotide polymorphisms being vastly smaller within the former species. This genome content variation, as well as loss-of-function variation in the form of premature stop codons and frameshifting indels, is heavily enriched in the subtelomeres, strongly reinforcing the relevance of these regions to functional evolution. Genes affected by these likely functional forms of variation are enriched for functions mediating interaction with the external environment (sugar transport and metabolism, flocculation, metal transport, and metabolism). Our results and analyses provide a comprehensive view of genomic diversity in budding yeast and expose surprising and pronounced differences between the variation within S. cerevisiae and that within S. paradoxus. We also believe that the sequence data and de novo assemblies will constitute a useful resource for further evolutionary and population genomics studies.
Assuntos
Genes Fúngicos , Saccharomyces cerevisiae/genética , Arsenitos/farmacologia , Variações do Número de Cópias de DNA , Farmacorresistência Fúngica/genética , Evolução Molecular , Ligação Genética , Especiação Genética , Genoma Fúngico , Anotação de Sequência Molecular , Família Multigênica , Filogenia , Polimorfismo de Nucleotídeo Único , Saccharomyces cerevisiae/efeitos dos fármacos , Saccharomyces cerevisiae/crescimento & desenvolvimento , Análise de Sequência de DNA , Compostos de Sódio/farmacologiaRESUMO
Since the completion of the genome sequence of Saccharomyces cerevisiae in 1996 (refs 1, 2), there has been a large increase in complete genome sequences, accompanied by great advances in our understanding of genome evolution. Although little is known about the natural and life histories of yeasts in the wild, there are an increasing number of studies looking at ecological and geographic distributions, population structure and sexual versus asexual reproduction. Less well understood at the whole genome level are the evolutionary processes acting within populations and species that lead to adaptation to different environments, phenotypic differences and reproductive isolation. Here we present one- to fourfold or more coverage of the genome sequences of over seventy isolates of the baker's yeast S. cerevisiae and its closest relative, Saccharomyces paradoxus. We examine variation in gene content, single nucleotide polymorphisms, nucleotide insertions and deletions, copy numbers and transposable elements. We find that phenotypic variation broadly correlates with global genome-wide phylogenetic relationships. S. paradoxus populations are well delineated along geographic boundaries, whereas the variation among worldwide S. cerevisiae isolates shows less differentiation and is comparable to a single S. paradoxus population. Rather than one or two domestication events leading to the extant baker's yeasts, the population structure of S. cerevisiae consists of a few well-defined, geographically isolated lineages and many different mosaics of these lineages, supporting the idea that human influence provided the opportunity for cross-breeding and production of new combinations of pre-existing variations.
Assuntos
Genoma Fúngico/genética , Genômica , Saccharomyces cerevisiae/genética , Saccharomyces/genética , Genética Populacional , Geografia , Mutação INDEL/genética , Fenótipo , Filogenia , Polimorfismo de Nucleotídeo Único/genética , Saccharomyces/classificação , Seleção GenéticaRESUMO
Screening genome-wide sets of mutants for fitness defects provides a simple but powerful approach for exploring gene function, mapping genetic networks and probing mechanisms of drug action. For yeast and other microorganisms with global mutant collections, genetic or chemical-genetic interactions can be effectively quantified by growing an ordered array of strains on agar plates as individual colonies, and then scoring the colony size changes in response to a genetic or environmental perturbation. To do so, requires efficient tools for the extraction and analysis of quantitative data. Here, we describe SGAtools (http://sgatools.ccbr.utoronto.ca), a web-based analysis system for designer genetic screens. SGAtools outlines a series of guided steps that allow the user to quantify colony sizes from images of agar plates, correct for systematic biases in the observations and calculate a fitness score relative to a control experiment. The data can also be visualized online to explore the colony sizes on individual plates, view the distribution of resulting scores, highlight genes with the strongest signal and perform Gene Ontology enrichment analysis.
Assuntos
Deleção de Genes , Análise em Microsséries , Software , Gráficos por Computador , Aptidão Genética , Processamento de Imagem Assistida por Computador , Internet , Leveduras/genética , Leveduras/crescimento & desenvolvimentoRESUMO
A genetic interaction occurs when the phenotype of an organism carrying two mutant genes differs from what should have been observed given their independent influence. Such unexpected outcome indicates a mechanistic connection between the perturbed genes, providing a key source of functional information about the cell. Large-scale screening for genetic interactions involves measuring phenotypes of single and double mutants, which for microorganisms is usually done by automated analysis of images of ordered colonies. Obtaining accurate colony sizes, and using them to identify genetic interactions from such screens remains a challenging and time-consuming task. Here, we outline steps to compute genetic interaction scores in E. coli by measuring colony sizes from plate images, performing normalisation, and quantifying the strength of the effect.
Assuntos
Bactérias/genética , Controle de QualidadeRESUMO
The genetic basis of gene expression variation has long been studied with the aim to understand the landscape of regulatory variants, but also more recently to assist in the interpretation and elucidation of disease signals. To date, many studies have looked in specific tissues and population-based samples, but there has been limited assessment of the degree of inter-population variability in regulatory variation. We analyzed genome-wide gene expression in lymphoblastoid cell lines from a total of 726 individuals from 8 global populations from the HapMap3 project and correlated gene expression levels with HapMap3 SNPs located in cis to the genes. We describe the influence of ancestry on gene expression levels within and between these diverse human populations and uncover a non-negligible impact on global patterns of gene expression. We further dissect the specific functional pathways differentiated between populations. We also identify 5,691 expression quantitative trait loci (eQTLs) after controlling for both non-genetic factors and population admixture and observe that half of the cis-eQTLs are replicated in one or more of the populations. We highlight patterns of eQTL-sharing between populations, which are partially determined by population genetic relatedness, and discover significant sharing of eQTL effects between Asians, European-admixed, and African subpopulations. Specifically, we observe that both the effect size and the direction of effect for eQTLs are highly conserved across populations. We observe an increasing proximity of eQTLs toward the transcription start site as sharing of eQTLs among populations increases, highlighting that variants close to TSS have stronger effects and therefore are more likely to be detected across a wider panel of populations. Together these results offer a unique picture and resource of the degree of differentiation among human populations in functional regulatory variation and provide an estimate for the transferability of complex trait variants across populations.
Assuntos
Regulação da Expressão Gênica , Locos de Características Quantitativas/genética , Sequências Reguladoras de Ácido Nucleico/genética , Sítio de Iniciação de Transcrição , Povo Asiático/genética , População Negra/genética , Linhagem Celular , Genética Populacional , Genoma Humano , Projeto HapMap , Humanos , Polimorfismo de Nucleotídeo Único , População Branca/genéticaRESUMO
Small RNAs are functional molecules that modulate mRNA transcripts and have been implicated in the aetiology of several common diseases. However, little is known about the extent of their variability within the human population. Here, we characterise the extent, causes, and effects of naturally occurring variation in expression and sequence of small RNAs from adipose tissue in relation to genotype, gene expression, and metabolic traits in the MuTHER reference cohort. We profiled the expression of 15 to 30 base pair RNA molecules in subcutaneous adipose tissue from 131 individuals using high-throughput sequencing, and quantified levels of 591 microRNAs and small nucleolar RNAs. We identified three genetic variants and three RNA editing events. Highly expressed small RNAs are more conserved within mammals than average, as are those with highly variable expression. We identified 14 genetic loci significantly associated with nearby small RNA expression levels, seven of which also regulate an mRNA transcript level in the same region. In addition, these loci are enriched for variants significant in genome-wide association studies for body mass index. Contrary to expectation, we found no evidence for negative correlation between expression level of a microRNA and its target mRNAs. Trunk fat mass, body mass index, and fasting insulin were associated with more than twenty small RNA expression levels each, while fasting glucose had no significant associations. This study highlights the similar genetic complexity and shared genetic control of small RNA and mRNA transcripts, and gives a quantitative picture of small RNA expression variation in the human population.