RESUMO
Latin America continues to be severely underrepresented in genomics research, and fine-scale genetic histories and complex trait architectures remain hidden owing to insufficient data1. To fill this gap, the Mexican Biobank project genotyped 6,057 individuals from 898 rural and urban localities across all 32 states in Mexico at a resolution of 1.8 million genome-wide markers with linked complex trait and disease information creating a valuable nationwide genotype-phenotype database. Here, using ancestry deconvolution and inference of identity-by-descent segments, we inferred ancestral population sizes across Mesoamerican regions over time, unravelling Indigenous, colonial and postcolonial demographic dynamics2-6. We observed variation in runs of homozygosity among genomic regions with different ancestries reflecting distinct demographic histories and, in turn, different distributions of rare deleterious variants. We conducted genome-wide association studies (GWAS) for 22 complex traits and found that several traits are better predicted using the Mexican Biobank GWAS compared to the UK Biobank GWAS7,8. We identified genetic and environmental factors associating with trait variation, such as the length of the genome in runs of homozygosity as a predictor for body mass index, triglycerides, glucose and height. This study provides insights into the genetic histories of individuals in Mexico and dissects their complex trait architectures, both crucial for making precision and preventive medicine initiatives accessible worldwide.
Assuntos
Bancos de Espécimes Biológicos , Genética Médica , Genoma Humano , Genômica , Hispânico ou Latino , Humanos , Glicemia/genética , Glicemia/metabolismo , Estatura/genética , Índice de Massa Corporal , Interação Gene-Ambiente , Marcadores Genéticos/genética , Estudo de Associação Genômica Ampla , Hispânico ou Latino/classificação , Hispânico ou Latino/genética , Homozigoto , México , Fenótipo , Triglicerídeos/sangue , Triglicerídeos/genética , Reino Unido , Genoma Humano/genéticaRESUMO
A widely held-but rarely tested-hypothesis for the origin of animals is that they evolved from a unicellular ancestor, with an apical cilium surrounded by a microvillar collar, that structurally resembled modern sponge choanocytes and choanoflagellates1-4. Here we test this view of animal origins by comparing the transcriptomes, fates and behaviours of the three primary sponge cell types-choanocytes, pluripotent mesenchymal archaeocytes and epithelial pinacocytes-with choanoflagellates and other unicellular holozoans. Unexpectedly, we find that the transcriptome of sponge choanocytes is the least similar to the transcriptomes of choanoflagellates and is significantly enriched in genes unique to either animals or sponges alone. By contrast, pluripotent archaeocytes upregulate genes that control cell proliferation and gene expression, as in other metazoan stem cells and in the proliferating stages of two unicellular holozoans, including a colonial choanoflagellate. Choanocytes in the sponge Amphimedon queenslandica exist in a transient metastable state and readily transdifferentiate into archaeocytes, which can differentiate into a range of other cell types. These sponge cell-type conversions are similar to the temporal cell-state changes that occur in unicellular holozoans5. Together, these analyses argue against homology of sponge choanocytes and choanoflagellates, and the view that the first multicellular animals were simple balls of cells with limited capacity to differentiate. Instead, our results are consistent with the first animal cell being able to transition between multiple states in a manner similar to modern transdifferentiating and stem cells.
Assuntos
Transdiferenciação Celular , Modelos Biológicos , Filogenia , Células-Tronco Pluripotentes/citologia , Poríferos/citologia , Animais , Proliferação de Células , Células Epiteliais/citologia , Células Epiteliais/metabolismo , Evolução Molecular , Células-Tronco Pluripotentes/metabolismo , Poríferos/metabolismo , Reprodutibilidade dos Testes , TranscriptomaRESUMO
An Amendment to this paper has been published and can be accessed via a link at the top of the paper.
RESUMO
BACKGROUND: Plant long non-coding RNAs (lncRNAs) have important regulatory roles in responses to various biotic and abiotic stresses, including light quality. However, no lncRNAs have been specifically linked to the Shade Avoidance Response (SAS). RESULTS: To better understand the involvement of lncRNAs in shade avoidance, we examined RNA-seq libraries for lncRNAs with the potential to function in the neighbor proximity phenomenon in Arabidopsis thaliana (A. thaliana). Using transcriptomes generated from seedlings exposed to high and low red/far-red (R/FR) light conditions, we identified 13 lncRNA genes differentially expressed in cotyledons and 138 in hypocotyls. To infer possible functions for these lncRNAs, we used a 'guilt-by-association' approach to identify genes co-expressed with lncRNAs in a weighted gene co-expression network. Of 34 co-expression modules, 10 showed biological functions related to differential growth. We identified three potential lncRNAs co-regulated with genes related to SAS. T-DNA insertions in two of these lncRNAs were correlated with morphological differences in seedling responses to increased FR light, supporting our strategy for computational identification of lncRNAs involved in SAS. CONCLUSIONS: Using a computational approach, we identified multiple lncRNAs in Arabidopsis involved in SAS. T-DNA insertions caused altered phenotypes under low R/FR light, suggesting functional roles in shade avoidance. Further experiments are needed to determine the specific mechanisms of these lncRNAs in SAS.
Assuntos
Arabidopsis , Regulação da Expressão Gênica de Plantas , Luz , RNA Longo não Codificante , Arabidopsis/genética , RNA Longo não Codificante/genética , RNA Longo não Codificante/metabolismo , Redes Reguladoras de Genes , Perfilação da Expressão Gênica , Hipocótilo/genética , Hipocótilo/crescimento & desenvolvimento , Plântula/genética , Plântula/crescimento & desenvolvimento , Plântula/efeitos da radiação , Transcriptoma , Cotilédone/genéticaRESUMO
BACKGROUND: Long non-coding RNAs (lncRNAs) are defined as transcribed molecules longer than 200 nucleotides with little to no protein-coding potential. LncRNAs can regulate gene expression of nearby genes (cis-acting) or genes located on other chromosomes (trans-acting). Several methodologies have been developed to capture lncRNAs associated with chromatin at a genome-wide level. Analysis of RNA-DNA contacts can be combined with epigenetic and RNA-seq data to define potential lncRNAs involved in the regulation of gene expression. RESULTS: We performed Chromatin Associated RNA sequencing (ChAR-seq) in Anolis carolinensis to obtain the genome-wide map of the associations that RNA molecules have with chromatin. We analyzed the frequency of DNA contacts for different classes of RNAs and were able to define cis- and trans-acting lncRNAs. We integrated the ChAR-seq map of RNA-DNA contacts with epigenetic data for the acetylation of lysine 16 on histone H4 (H4K16ac), a mark connected to actively transcribed chromatin in lizards. We successfully identified three trans-acting lncRNAs significantly associated with the H4K16ac signal, which are likely involved in the regulation of gene expression in A. carolinensis. CONCLUSIONS: We show that the ChAR-seq method is a powerful tool to explore the RNA-DNA map of interactions. Moreover, in combination with epigenetic data, ChAR-seq can be applied in non-model species to establish potential roles for predicted lncRNAs that lack functional annotations.
Assuntos
Lagartos , RNA Longo não Codificante , Animais , Cromatina/genética , RNA Longo não Codificante/genética , RNA Longo não Codificante/metabolismo , Lagartos/genética , Lagartos/metabolismo , DNA/genética , GenomaRESUMO
[This corrects the article DOI: 10.1371/journal.pcbi.1009218.].
RESUMO
The crown-of-thorns starfish (COTS, the Acanthaster planci species group) is a highly fecund predator of reef-building corals throughout the Indo-Pacific region. COTS population outbreaks cause substantial loss of coral cover, diminishing the integrity and resilience of reef ecosystems. Here we sequenced genomes of COTS from the Great Barrier Reef, Australia and Okinawa, Japan to identify gene products that underlie species-specific communication and could potentially be used in biocontrol strategies. We focused on water-borne chemical plumes released from aggregating COTS, which make the normally sedentary starfish become highly active. Peptide sequences detected in these plumes by mass spectrometry are encoded in the COTS genome and expressed in external tissues. The exoproteome released by aggregating COTS consists largely of signalling factors and hydrolytic enzymes, and includes an expanded and rapidly evolving set of starfish-specific ependymin-related proteins. These secreted proteins may be detected by members of a large family of olfactory-receptor-like G-protein-coupled receptors that are expressed externally, sometimes in a sex-specific manner. This study provides insights into COTS-specific communication that may guide the generation of peptide mimetics for use on reefs with COTS outbreaks.
Assuntos
Recifes de Corais , Genoma/genética , Controle Biológico de Vetores , Estrelas-do-Mar/genética , Animais , Antozoários/parasitologia , Austrália , Biomimética , Feminino , Oceano Índico , Japão , Masculino , Espectrometria de Massas , Proteínas do Tecido Nervoso/química , Proteínas do Tecido Nervoso/metabolismo , Oceano Pacífico , Proteoma/análise , Proteoma/metabolismo , Fatores Sexuais , Especificidade da Espécie , Estrelas-do-Mar/anatomia & histologia , Estrelas-do-Mar/química , Estrelas-do-Mar/enzimologia , TranscriptomaRESUMO
While colocalization within a bacterial operon enables coexpression of the constituent genes, the mechanistic logic of clustering of nonhomologous monocistronic genes in eukaryotes is not immediately obvious. Biosynthetic gene clusters that encode pathways for specialized metabolites are an exception to the classical eukaryote rule of random gene location and provide paradigmatic exemplars with which to understand eukaryotic cluster dynamics and regulation. Here, using 3C, Hi-C, and Capture Hi-C (CHi-C) organ-specific chromosome conformation capture techniques along with high-resolution microscopy, we investigate how chromosome topology relates to transcriptional activity of clustered biosynthetic pathway genes in Arabidopsis thaliana Our analyses reveal that biosynthetic gene clusters are embedded in local hot spots of 3D contacts that segregate cluster regions from the surrounding chromosome environment. The spatial conformation of these cluster-associated domains differs between transcriptionally active and silenced clusters. We further show that silenced clusters associate with heterochromatic chromosomal domains toward the periphery of the nucleus, while transcriptionally active clusters relocate away from the nuclear periphery. Examination of chromosome structure at unrelated clusters in maize, rice, and tomato indicates that integration of clustered pathway genes into distinct topological domains is a common feature in plant genomes. Our results shed light on the potential mechanisms that constrain coexpression within clusters of nonhomologous eukaryotic genes and suggest that gene clustering in the one-dimensional chromosome is accompanied by compartmentalization of the 3D chromosome.
Assuntos
Arabidopsis/genética , Cromossomos de Plantas/genética , Família Multigênica , Proteínas de Plantas/genética , Solanum lycopersicum/genética , Zea mays/genética , Arabidopsis/metabolismo , Cromossomos de Plantas/metabolismo , Genoma de Planta , Solanum lycopersicum/metabolismo , Oryza/genética , Oryza/metabolismo , Proteínas de Plantas/metabolismo , Zea mays/metabolismoRESUMO
Long noncoding RNAs (lncRNAs) have recently emerged as prominent regulators of gene expression in eukaryotes. LncRNAs often drive the modification and maintenance of gene activation or gene silencing states via chromatin conformation rearrangements. In plants, lncRNAs have been shown to participate in gene regulation, and are essential to processes such as vernalization and photomorphogenesis. Despite their prominent functions, only over a dozen lncRNAs have been experimentally and functionally characterized. Similar to its animal counterparts, the rates of sequence divergence are much higher in plant lncRNAs than in protein coding mRNAs, making it difficult to identify lncRNA conservation using traditional sequence comparison methods. Beyond this, little is known about the evolutionary patterns of lncRNAs in plants. Here, we characterized the splicing conservation of lncRNAs in Brassicaceae. We generated a whole-genome alignment of 16 Brassica species and used it to identify synthenic lncRNA orthologs. Using a scoring system trained on transcriptomes from A. thaliana and B. oleracea, we identified splice sites across the whole alignment and measured their conservation. Our analysis revealed that 17.9% (112/627) of all intergenic lncRNAs display splicing conservation in at least one exon, an estimate that is substantially higher than previous estimates of lncRNA conservation in this group. Our findings agree with similar studies in vertebrates, demonstrating that splicing conservation can be evidence of stabilizing selection. We provide conclusive evidence for the existence of evolutionary deeply conserved lncRNAs in plants and describe a generally applicable computational workflow to identify functional lncRNAs in plants.
Assuntos
Sequência Conservada/genética , Splicing de RNA/genética , RNA Longo não Codificante/genética , RNA de Plantas/genética , Arabidopsis/genética , Brassica/genética , Evolução Molecular , Genoma de Planta/genética , RNA Mensageiro/genéticaRESUMO
Animals are grouped into ~35 'phyla' based upon the notion of distinct body plans. Morphological and molecular analyses have revealed that a stage in the middle of development--known as the phylotypic period--is conserved among species within some phyla. Although these analyses provide evidence for their existence, phyla have also been criticized as lacking an objective definition, and consequently based on arbitrary groupings of animals. Here we compare the developmental transcriptomes of ten species, each annotated to a different phylum, with a wide range of life histories and embryonic forms. We find that in all ten species, development comprises the coupling of early and late phases of conserved gene expression. These phases are linked by a divergent 'mid-developmental transition' that uses species-specific suites of signalling pathways and transcription factors. This mid-developmental transition overlaps with the phylotypic period that has been defined previously for three of the ten phyla, suggesting that transcriptional circuits and signalling mechanisms active during this transition are crucial for defining the phyletic body plan and that the mid-developmental transition may be used to define phylotypic periods in other phyla. Placing these observations alongside the reported conservation of mid-development within phyla, we propose that a phylum may be defined as a collection of species whose gene expression at the mid-developmental transition is both highly conserved among them, yet divergent relative to other species.
Assuntos
Padronização Corporal , Desenvolvimento Embrionário , Filogenia , Animais , Padronização Corporal/genética , Sequência Conservada/genética , Desenvolvimento Embrionário/genética , Evolução Molecular , Regulação da Expressão Gênica no Desenvolvimento , Redes Reguladoras de Genes , Genes Controladores do Desenvolvimento/genética , Modelos Biológicos , Fenótipo , Especificidade da Espécie , Transcriptoma/genéticaRESUMO
BACKGROUND: Micro RNAs (miRNAs) and piwi interacting RNAs (piRNAs), along with the more ancient eukaryotic endogenous small interfering RNAs (endo-siRNAs) constitute the principal components of the RNA interference (RNAi) repertoire of most animals. RNAi in non-bilaterians - sponges, ctenophores, placozoans and cnidarians - appears to be more diverse than that of bilaterians, and includes structurally variable miRNAs in sponges, an enormous number of piRNAs in cnidarians and the absence of miRNAs in ctenophores and placozoans. RESULTS: Here we identify thousands of endo-siRNAs and piRNAs from the sponge Amphimedon queenslandica, the ctenophore Mnemiopsis leidyi and the cnidarian Nematostella vectensis using a computational approach that clusters mapped small RNA sequences and annotates each cluster based on the read length and relative abundance of the constituent reads. This approach was validated on 11 small RNA libraries in Drosophila melanogaster, demonstrating the successful annotation of RNAi-associated loci with properties consistent with previous reports. In the non-bilaterians we uncover seven new miRNAs from Amphimedon and four from Nematostella as well as sub-populations of candidate cis-natural antisense transcript (cis-NAT) endo-siRNAs. We confirmed the absence of miRNAs in Mnemiopsis but detected an abundance of endo-siRNAs in this ctenophore. Analysis of putative piRNA structure suggests that conserved localised secondary structures in primary transcripts may be important for the production of mature piRNAs in Amphimedon and Nematostella, as is also the case for endo-siRNAs. CONCLUSION: Together, these findings suggest that the last common ancestor of extant animals did not have the entrained RNAi system that typifies bilaterians. Instead it appears that bilaterians, cnidarians, ctenophores and sponges express unique repertoires and combinations of miRNAs, piRNAs and endo-siRNAs.
Assuntos
Evolução Biológica , Interferência de RNA , Animais , Ctenóforos/genética , Drosophila/genética , Biblioteca Gênica , Genoma , MicroRNAs/genética , Anotação de Sequência Molecular , RNA Interferente Pequeno/metabolismo , Anêmonas-do-Mar/genéticaRESUMO
RNA-Seq enables the efficient transcriptome sequencing of many samples from small amounts of material, but the analysis of these data remains challenging. In particular, in developmental studies, RNA-Seq is challenged by the morphological staging of samples, such as embryos, since these often lack clear markers at any particular stage. In such cases, the automatic identification of the stage of a sample would enable previously infeasible experimental designs. Here we present the 'basic linear index determination of transcriptomes' (BLIND) method for ordering samples comprising different developmental stages. The method is an implementation of a traveling salesman algorithm to order the transcriptomes according to their inter-relationships as defined by principal components analysis. To establish the direction of the ordered samples, we show that an appropriate indicator is the entropy of transcriptomic gene expression levels, which increases over developmental time. Using BLIND, we correctly recover the annotated order of previously published embryonic transcriptomic timecourses for frog, mosquito, fly and zebrafish. We further demonstrate the efficacy of BLIND by collecting 59 embryos of the sponge Amphimedon queenslandica and ordering their transcriptomes according to developmental stage. BLIND is thus useful in establishing the temporal order of samples within large datasets and is of particular relevance to the study of organisms with asynchronous development and when morphological staging is difficult.
Assuntos
Perfilação da Expressão Gênica/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Transcriptoma/genética , Animais , Regulação da Expressão Gênica no Desenvolvimento , Análise de Componente PrincipalRESUMO
Long noncoding RNAs (lncRNAs) are important developmental regulators in bilaterian animals. A correlation has been claimed between the lncRNA repertoire expansion and morphological complexity in vertebrate evolution. However, this claim has not been tested by examining morphologically simple animals. Here, we undertake a systematic investigation of lncRNAs in the demosponge Amphimedon queenslandica, a morphologically simple, early-branching metazoan. We combine RNA-Seq data across multiple developmental stages of Amphimedon with a filtering pipeline to conservatively predict 2,935 lncRNAs. These include intronic overlapping lncRNAs, exonic antisense overlapping lncRNAs, long intergenic nonprotein coding RNAs, and precursors for small RNAs. Sponge lncRNAs are remarkably similar to their bilaterian counterparts in being relatively short with few exons and having low primary sequence conservation relative to protein-coding genes. As in bilaterians, a majority of sponge lncRNAs exhibit typical hallmarks of regulatory molecules, including high temporal specificity and dynamic developmental expression. Specific lncRNA expression profiles correlate tightly with conserved protein-coding genes likely involved in a range of developmental and physiological processes, such as the Wnt signaling pathway. Although the majority of Amphimedon lncRNAs appears to be taxonomically restricted with no identifiable orthologs, we find a few cases of conservation between demosponges in lncRNAs that are antisense to coding sequences. Based on the high similarity in the structure, organization, and dynamic expression of sponge lncRNAs to their bilaterian counterparts, we propose that these noncoding RNAs are an ancient feature of the metazoan genome. These results are consistent with lncRNAs regulating the development of animals, regardless of their level of morphological complexity.
Assuntos
Poríferos/genética , RNA Longo não Codificante/genética , Animais , Sequência de Bases , Sequência Conservada , Epigênese Genética , Perfilação da Expressão Gênica , Regulação da Expressão Gênica no Desenvolvimento , Especiação Genética , Poríferos/metabolismo , RNA Longo não Codificante/metabolismo , TranscriptomaRESUMO
BACKGROUND: The demosponge Amphimedon queenslandica is amongst the few early-branching metazoans with an assembled and annotated draft genome, making it an important species in the study of the origin and early evolution of animals. Current gene models in this species are largely based on in silico predictions and low coverage expressed sequence tag (EST) evidence. RESULTS: Amphimedon queenslandica protein-coding gene models are improved using deep RNA-Seq data from four developmental stages and CEL-Seq data from 82 developmental samples. Over 86% of previously predicted genes are retained in the new gene models, although 24% have additional exons; there is also a marked increase in the total number of annotated 3' and 5' untranslated regions (UTRs). Importantly, these new developmental transcriptome data reveal numerous previously unannotated protein-coding genes in the Amphimedon genome, increasing the total gene number by 25%, from 30,060 to 40,122. In general, Amphimedon genes have introns that are markedly smaller than those in other animals and most of the alternatively spliced genes in Amphimedon undergo intron-retention; exon-skipping is the least common mode of alternative splicing. Finally, in addition to canonical polyadenylation signal sequences, Amphimedon genes are enriched in a number of unique AT-rich motifs in their 3' UTRs. CONCLUSIONS: The inclusion of developmental transcriptome data has substantially improved the structure and composition of protein-coding gene models in Amphimedon queenslandica, providing a more accurate and comprehensive set of genes for functional and comparative studies. These improvements reveal the Amphimedon genome is comprised of a remarkably high number of tightly packed genes. These genes have small introns and there is pervasive intron retention amongst alternatively spliced transcripts. These aspects of the sponge genome are more similar unicellular opisthokont genomes than to other animal genomes.
Assuntos
Perfilação da Expressão Gênica/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Anotação de Sequência Molecular/métodos , Poríferos/crescimento & desenvolvimento , Poríferos/genética , Processamento Alternativo , Animais , Sequência Conservada , Etiquetas de Sequências Expressas/metabolismo , Genômica , Íntrons/genética , Análise de Sequência de RNARESUMO
Long noncoding RNAs (lncRNAs) are essential regulatory elements of sex chromosomes that act to equalize gene expression levels between males and females. XIST, RSX, and roX2 regulate X chromosomes in placental mammals, marsupials, and Drosophila, respectively. Because the green anole (Anolis carolinensis) shows complete dosage compensation of its X chromosome, we tested whether a lncRNA was involved. We found an ancient lncRNA, MAYEX, that gained male-specific expression more than 89 million years ago. MAYEX evolved a notable association with the acetylated histone 4 lysine 16 (H4K16ac) epigenetic mark and the ability to loop its locus to the totality of the X chromosome to increase expression levels. MAYEX is the first lncRNA in reptiles linked to a dosage compensation mechanism that balances the expression of sex chromosomes.
Assuntos
Mecanismo Genético de Compensação de Dose , Lagartos , RNA Longo não Codificante , Cromossomo X , Animais , Feminino , Masculino , Acetilação , Epigênese Genética , Evolução Molecular , Histonas/metabolismo , RNA Longo não Codificante/genética , RNA Longo não Codificante/metabolismo , Cromossomo X/genética , Lagartos/genéticaRESUMO
MicroRNAs are important regulators of developmental gene expression, but their contribution to fetal gonad development is not well understood. We have identified the evolutionarily conserved gonadal microRNAs miR-202-5p and miR-202-3p as having a potential role in regulating mouse embryonic gonad differentiation. These microRNAs are expressed in a sexually dimorphic pattern as the primordial XY gonad differentiates into a testis, with strong expression in Sertoli cells. In vivo, ectopic expression of pri-miR-202 in XX gonads did not result in molecular changes to the ovarian determination pathway. Expression of the primary transcript of miR-202-5p/3p remained low in XY gonads in a conditional Sox9-null mouse model, suggesting that pri-miR-202 transcription is downstream of SOX9, a transcription factor that is both necessary and sufficient for male sex determination. We identified the pri-miR-202 promoter that is sufficient to drive expression in XY but not XX fetal gonads ex vivo. Mutation of SOX9 and SF1 binding sites reduced ex vivo transactivation of the pri-miR-202 promoter, demonstrating that pri-miR-202 may be a direct transcriptional target of SOX9/SF1 during testis differentiation. Our findings indicate that expression of the conserved gonad microRNA, miR-202-5p/3p, is downstream of the testis-determining factor SOX9, suggesting an early role in testis development.
Assuntos
Regulação da Expressão Gênica no Desenvolvimento , MicroRNAs/metabolismo , Organogênese/genética , Fatores de Transcrição SOX9/metabolismo , Testículo/embriologia , Animais , Diferenciação Celular/genética , Masculino , Camundongos , Camundongos Knockout , MicroRNAs/genética , Regiões Promotoras Genéticas , Fatores de Transcrição SOX9/genética , Células de Sertoli/citologia , Células de Sertoli/metabolismo , Diferenciação Sexual/genética , Testículo/citologia , Testículo/metabolismo , Transcrição GênicaRESUMO
MicroRNAs (miRNAs) have been shown to play key regulatory roles in a range of biological processes, including cell differentiation and development. To identify miRNAs that participate in gonad differentiation, a fundamental and tightly regulated developmental process, we examined miRNA expression profiles at the time of sex determination and during the early fetal differentiation of mouse testes and ovaries using high-throughput sequencing. We identified several miRNAs that were expressed in a sexually dimorphic pattern, including several members of the let-7 family, miR-378, and miR-140-3p. We focused our analysis on the most highly expressed, sexually dimorphic miRNA, miR-140-3p, and found that both miR-140-3p and its more lowly expressed counterpart, the previously annotated guide strand, miR-140-5p, are testis enriched and expressed in testis cords. Analysis of the miR-140-5p/miR-140-3p-null mouse revealed a significant increase in the number of Leydig cells in the developing XY gonad, strongly suggesting an important role for miR-140-5p/miR-140-3p in testis differentiation in mouse.