RESUMO
The origins of neural systems remain unresolved. In contrast to other basal metazoans, ctenophores (comb jellies) have both complex nervous and mesoderm-derived muscular systems. These holoplanktonic predators also have sophisticated ciliated locomotion, behaviour and distinct development. Here we present the draft genome of Pleurobrachia bachei, Pacific sea gooseberry, together with ten other ctenophore transcriptomes, and show that they are remarkably distinct from other animal genomes in their content of neurogenic, immune and developmental genes. Our integrative analyses place Ctenophora as the earliest lineage within Metazoa. This hypothesis is supported by comparative analysis of multiple gene families, including the apparent absence of HOX genes, canonical microRNA machinery, and reduced immune complement in ctenophores. Although two distinct nervous systems are well recognized in ctenophores, many bilaterian neuron-specific genes and genes of 'classical' neurotransmitter pathways either are absent or, if present, are not expressed in neurons. Our metabolomic and physiological data are consistent with the hypothesis that ctenophore neural systems, and possibly muscle specification, evolved independently from those in other animals.
Assuntos
Ctenóforos/genética , Evolução Molecular , Genoma/genética , Sistema Nervoso , Animais , Ctenóforos/classificação , Ctenóforos/imunologia , Ctenóforos/fisiologia , Genes Controladores do Desenvolvimento , Genes Homeobox , Mesoderma/metabolismo , Metabolômica , MicroRNAs , Dados de Sequência Molecular , Músculos/fisiologia , Sistema Nervoso/metabolismo , Neurônios/metabolismo , Neurotransmissores , Filogenia , Transcriptoma/genéticaRESUMO
We present a database of repetitive DNA elements, called Dfam (http://dfam.janelia.org). Many genomes contain a large fraction of repetitive DNA, much of which is made up of remnants of transposable elements (TEs). Accurate annotation of TEs enables research into their biology and can shed light on the evolutionary processes that shape genomes. Identification and masking of TEs can also greatly simplify many downstream genome annotation and sequence analysis tasks. The commonly used TE annotation tools RepeatMasker and Censor depend on sequence homology search tools such as cross_match and BLAST variants, as well as Repbase, a collection of known TE families each represented by a single consensus sequence. Dfam contains entries corresponding to all Repbase TE entries for which instances have been found in the human genome. Each Dfam entry is represented by a profile hidden Markov model, built from alignments generated using RepeatMasker and Repbase. When used in conjunction with the hidden Markov model search tool nhmmer, Dfam produces a 2.9% increase in coverage over consensus sequence search methods on a large human benchmark, while maintaining low false discovery rates, and coverage of the full human genome is 54.5%. The website provides a collection of tools and data views to support improved TE curation and annotation efforts. Dfam is also available for download in flat file format or in the form of MySQL table dumps.
Assuntos
Elementos de DNA Transponíveis , Bases de Dados de Ácidos Nucleicos , Genoma Humano , Humanos , Internet , Cadeias de Markov , Modelos Estatísticos , Anotação de Sequência MolecularRESUMO
BACKGROUND: Crocodilians are thought to be hosts to a diverse and divergent complement of endogenous retroviruses (ERVs) but a comprehensive investigation is yet to be performed. The recent sequencing of three crocodilian genomes provides an opportunity for a more detailed and accurate representation of the ERV diversity that is present in these species. Here we investigate the diversity, distribution and evolution of ERVs from the genomes of three key crocodilian species, and outline the key processes driving crocodilian ERV proliferation and evolution. RESULTS: ERVs and ERV related sequences make up less than 2% of crocodilian genomes. We recovered and described 45 ERV groups within the three crocodilian genomes, many of which are species specific. We have also revealed a new class of ERV, ERV4, which appears to be common to crocodilians and turtles, and currently has no characterised exogenous counterpart. For the first time, we formally describe the characteristics of this ERV class and its classification relative to other recognised ERV and retroviral classes. This class shares some sequence similarity and sequence characteristics with ERV3, although it is phylogenetically distinct from the other ERV classes. We have also identified two instances of gene capture by crocodilian ERVs, one of which, the capture of a host KIT-ligand mRNA has occurred without the loss of an ERV domain. CONCLUSIONS: This study indicates that crocodilian ERVs comprise a wide variety of lineages, many of which appear to reflect ancient infections. In particular, ERV4 appears to have a limited host range, with current data suggesting that it is confined to crocodilians and some lineages of turtles. Also of interest are two ERV groups that demonstrate evidence of host gene capture. This study provides a framework to facilitate further studies into non-mammalian vertebrates and highlights the need for further studies into such species.
Assuntos
Jacarés e Crocodilos/genética , Jacarés e Crocodilos/virologia , Retrovirus Endógenos/classificação , Retrovirus Endógenos/genética , Evolução Molecular , Variação Genética , Genoma , Animais , Análise por Conglomerados , Biologia Computacional , Filogenia , Recombinação Genética , Homologia de Sequência , Tartarugas/virologiaRESUMO
Lancelets ('amphioxus') are the modern survivors of an ancient chordate lineage, with a fossil record dating back to the Cambrian period. Here we describe the structure and gene content of the highly polymorphic approximately 520-megabase genome of the Florida lancelet Branchiostoma floridae, and analyse it in the context of chordate evolution. Whole-genome comparisons illuminate the murky relationships among the three chordate groups (tunicates, lancelets and vertebrates), and allow not only reconstruction of the gene complement of the last common chordate ancestor but also partial reconstruction of its genomic organization, as well as a description of two genome-wide duplications and subsequent reorganizations in the vertebrate lineage. These genome-scale events shaped the vertebrate genome and provided additional genetic variation for exploitation during vertebrate evolution.
Assuntos
Cordados/genética , Evolução Molecular , Genoma/genética , Animais , Cordados/classificação , Sequência Conservada , Elementos de DNA Transponíveis/genética , Duplicação Gênica , Genes/genética , Ligação Genética , Humanos , Íntrons/genética , Cariotipagem , Família Multigênica , Filogenia , Polimorfismo Genético/genética , Proteínas/genética , Sintenia , Fatores de Tempo , Vertebrados/classificação , Vertebrados/genéticaRESUMO
We report a high-quality draft of the genome sequence of the grey, short-tailed opossum (Monodelphis domestica). As the first metatherian ('marsupial') species to be sequenced, the opossum provides a unique perspective on the organization and evolution of mammalian genomes. Distinctive features of the opossum chromosomes provide support for recent theories about genome evolution and function, including a strong influence of biased gene conversion on nucleotide sequence composition, and a relationship between chromosomal characteristics and X chromosome inactivation. Comparison of opossum and eutherian genomes also reveals a sharp difference in evolutionary innovation between protein-coding and non-coding functional elements. True innovation in protein-coding genes seems to be relatively rare, with lineage-specific differences being largely due to diversification and rapid turnover in gene families involved in environmental interactions. In contrast, about 20% of eutherian conserved non-coding elements (CNEs) are recent inventions that postdate the divergence of Eutheria and Metatheria. A substantial proportion of these eutherian-specific CNEs arose from sequence inserted by transposable elements, pointing to transposons as a major creative force in the evolution of mammalian gene regulation.
Assuntos
Evolução Molecular , Genoma/genética , Genômica , Gambás/genética , Animais , Composição de Bases , Sequência Conservada/genética , Elementos de DNA Transponíveis/genética , Humanos , Polimorfismo de Nucleotídeo Único/genética , Biossíntese de Proteínas , Sintenia/genética , Inativação do Cromossomo X/genéticaRESUMO
Autonomous non-long terminal repeat (non-LTR) retrotransposons and their repetitive remnants are ubiquitous components of mammalian genomes. Recently, we identified non-LTR retrotransposon families, Ingi-1_AAl and Ingi-1_EE, in two hedgehog genomes. Here we rename them to Vingi-1_AAl and Vingi-1_EE and report a new clade "Vingi," which is a sister clade of Ingi that lacks the ribonuclease H domain. In the European hedgehog genome, there are 11 non-autonomous families of elements derived from Vingi-1_EE by internal deletions. No retrotransposons related to Vingi elements were found in any of the remaining 33 mammalian genomes nearly completely sequenced to date, but we identified several new families of Vingi and Ingi retrotransposons outside mammals. Our data suggest the horizontal transfer of Vingi elements to hedgehog, although the vertical transfer cannot be ruled out. The compact structure and trans-mobilization of nonautonomous derivatives of Vingi can make them useful for in vivo retrotransposition assay system.
Assuntos
Ouriços/genética , Retroelementos/genética , Sequências Repetidas Terminais , Animais , Sequência de Bases , Evolução Molecular , Genoma , Humanos , Dados de Sequência Molecular , FilogeniaRESUMO
DNA methylation has important functions in stable, transcriptional gene silencing, immobilization of transposable elements and genome organization. In Arabidopsis, DNA methylation can be induced by double-stranded RNA through the RNA interference (RNAi) pathway, a response known as RNA-directed DNA methylation. This requires a specialized set of RNAi components, including ARGONAUTE4 (AGO4). Here we show that AGO4 binds to small RNAs including small interfering RNAs (siRNAs) originating from transposable and repetitive elements, and cleaves target RNA transcripts. Single mutations in the Asp-Asp-His catalytic motif of AGO4 do not affect siRNA-binding activity but abolish its catalytic potential. siRNA accumulation and non-CpG DNA methylation at some loci require the catalytic activity of AGO4, whereas others are less dependent on this activity. Our results are consistent with a model in which AGO4 can function at target loci through two distinct and separable mechanisms. First, AGO4 can recruit components that signal DNA methylation in a manner independent of its catalytic activity. Second, AGO4 catalytic activity can be crucial for the generation of secondary siRNAs that reinforce its repressive effects.
Assuntos
Proteínas de Arabidopsis/metabolismo , Metilação de DNA , DNA de Plantas/metabolismo , RNA de Plantas/metabolismo , Sequência de Aminoácidos , Proteínas de Arabidopsis/química , Proteínas de Arabidopsis/genética , Proteínas Argonautas , Catálise , Ilhas de CpG/genética , DNA de Plantas/genética , Humanos , Dados de Sequência Molecular , Interferência de RNA , RNA de Plantas/genética , RNA Interferente Pequeno/genética , RNA Interferente Pequeno/metabolismo , Alinhamento de SequênciaRESUMO
Gibbon species have accumulated an unusually high number of chromosomal changes since diverging from the common hominoid ancestor 15-18 million years ago. The cause of this increased rate of chromosomal rearrangements is not known, nor is it known if genome architecture has a role. To address this question, we analyzed sequences spanning 57 breaks of synteny between northern white-cheeked gibbons (Nomascus l. leucogenys) and humans. We find that the breakpoint regions are enriched in segmental duplications and repeats, with Alu elements being the most abundant. Alus located near the gibbon breakpoints (<150 bp) have a higher CpG content than other Alus. Bisulphite allelic sequencing reveals that these gibbon Alus have a lower average density of methylated cytosine that their human orthologues. The finding of higher CpG content and lower average CpG methylation suggests that the gibbon Alu elements are epigenetically distinct from their human orthologues. The association between undermethylation and chromosomal rearrangement in gibbons suggests a correlation between epigenetic state and structural genome variation in evolution.
Assuntos
Citosina/metabolismo , Metilação de DNA , Evolução Molecular , Hylobates/genética , Elementos Alu , Animais , Mapeamento Cromossômico , Quebras de DNA , Epigênese Genética , Rearranjo Gênico , Genoma Humano , Humanos , Hylobates/metabolismo , Cariotipagem , Modelos Genéticos , Especificidade da Espécie , SinteniaRESUMO
The aspergilli comprise a diverse group of filamentous fungi spanning over 200 million years of evolution. Here we report the genome sequence of the model organism Aspergillus nidulans, and a comparative study with Aspergillus fumigatus, a serious human pathogen, and Aspergillus oryzae, used in the production of sake, miso and soy sauce. Our analysis of genome structure provided a quantitative evaluation of forces driving long-term eukaryotic genome evolution. It also led to an experimentally validated model of mating-type locus evolution, suggesting the potential for sexual reproduction in A. fumigatus and A. oryzae. Our analysis of sequence conservation revealed over 5,000 non-coding regions actively conserved across all three species. Within these regions, we identified potential functional elements including a previously uncharacterized TPP riboswitch and motifs suggesting regulation in filamentous fungi by Puf family genes. We further obtained comparative and experimental evidence indicating widespread translational regulation by upstream open reading frames. These results enhance our understanding of these widely studied fungi as well as provide new insight into eukaryotic genome evolution and gene regulation.
Assuntos
Aspergillus fumigatus/genética , Aspergillus nidulans/genética , Aspergillus oryzae/genética , Genoma Fúngico/genética , Genômica , Aspergillus fumigatus/fisiologia , Aspergillus nidulans/fisiologia , Aspergillus oryzae/fisiologia , Sequência de Bases , Sequência Consenso/genética , Sequência Conservada/genética , Evolução Molecular , Genes Fúngicos Tipo Acasalamento/genética , Humanos , Dados de Sequência Molecular , Fases de Leitura Aberta/genética , Filogenia , Proteoma/genética , Sequências Reguladoras de Ácido Nucleico/genética , Análise de Sequência de DNA , Sintenia/genéticaRESUMO
Ancient, inactive copies of transposable elements of the PIF/Harbinger superfamily have been described in vertebrates. We reconstructed components of the Harbinger3_DR transposon in zebrafish, including a transposase and a second, transposon-encoded protein that has a Myb-like trihelix domain. The reconstructed Harbinger transposon shows efficient cut-and-paste transposition in human cells and preferentially inserts into a 15-bp consensus target sequence. The Myb-like protein is required for transposition and physically interacts with the N-terminal region of the transposase via its C-terminal domain. The Myb-like protein enables transposition in part by promoting nuclear import of the transposase, by directly binding to subterminal regions of the transposon, and by recruiting the transposase to the transposon ends. We investigated the functions of two transposon-derived human proteins: HARBI1, a domesticated transposase-derived protein, and NAIF1, which contains a trihelix motif similar to that described in the Myb-like protein. Physical interaction, subcellular localization, and DNA-binding activities of HARBI1 and NAIF1 suggest strong functional homologies between the Harbinger3_DR system and their related, host-encoded counterparts. The Harbinger transposon will serve as a useful experimental system for transposon biology and for investigating the enzymatic functions of domesticated, transposon-derived cellular genes.
Assuntos
Proteínas Reguladoras de Apoptose/genética , Elementos de DNA Transponíveis/genética , Proteínas Nucleares/genética , Homologia de Sequência de Aminoácidos , Transposases/genética , Transporte Ativo do Núcleo Celular , Sequência de Aminoácidos , Animais , Proteínas Reguladoras de Apoptose/química , Pareamento de Bases , Sequência de Bases , Núcleo Celular/metabolismo , Sequência Consenso , Células HeLa , Humanos , Dados de Sequência Molecular , Mutagênese Insercional , Proteínas Nucleares/química , Ligação Proteica , Transporte Proteico , Proteínas Proto-Oncogênicas c-myb/metabolismo , Sequências Repetitivas de Ácido Nucleico/genética , Frações Subcelulares , Transposases/química , Peixe-ZebraRESUMO
Despite their enormous diversity and abundance, all currently known eukaryotic DNA transposons belong to only 15 superfamilies. Here, we report two new superfamilies of DNA transposons, named Sola and Zator. Sola transposons encode DDD-transposases (transposase, TPase) and are flanked by 4-bp target site duplications (TSD). Elements from the Sola superfamily are distributed in a variety of species including bacteria, protists, plants, and metazoans. They can be divided into three distinct groups of elements named Sola1, Sola2, and Sola3. The elements from each group have extremely low sequence identity to each other, different termini, and different target site preferences. However, all three groups belong to a single superfamily based on significant PSI-Blast identities between their TPases. The DDD TPase sequences encoded by Sola transposons are not similar to any known TPases. The second superfamily named Zator is characterized by 3-bp TSD. The Zator superfamily is relatively rare in eukaryotic species, and it evolved from a bacterial transposon encoding a TPase belonging to the "transposase 36" family (Pfam07592). These transposons are named TP36 elements (abbreviated from transposase 36).
Assuntos
Elementos de DNA Transponíveis/genética , Células Eucarióticas/metabolismo , Hydra/genética , Sequência de Aminoácidos , Animais , Bactérias/enzimologia , Sequência de Bases , Biocatálise , Genoma/genética , Dados de Sequência Molecular , Mutagênese Insercional , Filogenia , Alinhamento de Sequência , Homologia de Sequência do Ácido Nucleico , Transposases/químicaRESUMO
Rolling-circle eukaryotic transposons, known as Helitron transposons, were first discovered in plants (Arabidopsis thaliana and Oryza sativa) and in the nematode Caenorhabditis elegans. To date, Helitrons have been identified in a diverse range of species, from protists to mammals. They represent a major class of eukaryotic transposons and are fundamentally different from classical transposons in terms of their structure and mechanism of transposition. Helitrons seem to have a major role in the evolution of host genomes. They frequently capture diverse host genes, some of which can evolve into novel host genes or become essential for helitron transposition.
Assuntos
Elementos de DNA Transponíveis , Evolução Molecular , Animais , Células Eucarióticas , Duplicação Gênica , Variação Genética , Genoma , Humanos , Modelos Biológicos , Modelos GenéticosRESUMO
The evolutionary relationships of 7SL RNA-derived SINEs such as the primate Alu or the rodent B1 elements have hitherto been obscure. We established an unambiguous phylogenetic tree for Supraprimates, and derived intraordinal relationships of the 7SL RNA-derived SINEs. As well as new elements in Tupaia and primates, we also found that the purported ancestral fossil Alu monomer was restricted to Primates, and provide here the first description of a potential chimeric promoter box region in SINEs.
Assuntos
Elementos Alu/genética , Evolução Molecular , Primatas/genética , RNA Citoplasmático Pequeno/genética , Elementos Nucleotídeos Curtos e Dispersos/genética , Partícula de Reconhecimento de Sinal/genética , Animais , FilogeniaRESUMO
Human processed pseudogenes are copies of cellular RNAs reverse transcribed and inserted into the nuclear genome by the enzymatic machinery of L1 (LINE1) non-LTR retrotransposons. Although it is generally accepted that germline expression is crucial for the heritable retroposition of cellular mRNAs, little is known about the influences of RNA stability, mRNA quality control and compartmentalization of translation on the retroposition of processed pseudogenes. We found that frequently retroposed human mRNAs are derived from stable transcripts with translation-competent functional reading frames that are resistant to nonsense-mediated RNA decay. They are preferentially translated on free cytoplasmic ribosomes and encode soluble proteins. Our results indicate that interactions between mRNAs and L1 proteins seem to occur at free cytoplasmic ribosomes.
Assuntos
Elementos Nucleotídeos Longos e Dispersos/genética , Biossíntese de Proteínas , Pseudogenes/genética , Estabilidade de RNA , Retroelementos/genética , Animais , Expressão Gênica , Humanos , RNA Mensageiro/genética , RNA Mensageiro/metabolismoRESUMO
BACKGROUND: Repbase is a reference database of eukaryotic repetitive DNA, which includes prototypic sequences of repeats and basic information described in annotations. Repbase already has software for entering new sequence families and for comparing the user's sequence with the database of consensus sequences. RESULTS: We describe the software named VisualRepbase and the associated database, which allow for displaying and analyzing all occurrences of transposable element families present in an annotated genome. VisualRepbase is a Java-based interface which can download selected occurrences of transposable elements, show the distribution of given families on the chromosome, and present the localization of these occurrences with regard to gene annotations and other families of transposable elements in Repbase. In addition, it has several features for saving the graphical representation of occurrences, saving all sequences in FASTA format, and searching and saving all annotated genes that are surrounded by these occurrences. CONCLUSION: VisualRepbase is available as a downloadable version. It can be found at http://girinst.org/repbase/update/visual repbase.html.
Assuntos
Elementos de DNA Transponíveis , Bases de Dados Genéticas , Software , Mapeamento Cromossômico , Gráficos por Computador , Sequências Repetitivas de Ácido Nucleico , Interface Usuário-ComputadorRESUMO
Early studies of human Alu retrotransposons focused on their origin, evolution and biological properties, but current focus is shifting toward the effect of Alu elements on evolution of the human genome. Recent analyses indicate that numerous factors have affected the chromosomal distribution of Alu elements over time, including male-driven insertions, deletions and rapid CpG mutations after their retrotransposition. Unequal crossing over between Alu elements can lead to local mutations or to large segmental duplications responsible for genetic diseases and long-term evolutionary changes. Alu elements can also affect human (primate) evolution by introducing alternative splice sites in existing genes. Studying the Alu family in a human genomic context is likely to have general significance for our understanding of the evolutionary impact of other repetitive elements in diverse eukaryotic genomes.
Assuntos
Elementos Alu , Evolução Biológica , Polimorfismo Genético , Sequências Repetitivas de Ácido Nucleico , Variação Genética , Genoma , HumanosRESUMO
The V(D)J recombination reaction in jawed vertebrates is catalyzed by the RAG1 and RAG2 proteins, which are believed to have emerged approximately 500 million years ago from transposon-encoded proteins. Yet no transposase sequence similar to RAG1 or RAG2 has been found. Here we show that the approximately 600-amino acid "core" region of RAG1 required for its catalytic activity is significantly similar to the transposase encoded by DNA transposons that belong to the Transib superfamily. This superfamily was discovered recently based on computational analysis of the fruit fly and African malaria mosquito genomes. Transib transposons also are present in the genomes of sea urchin, yellow fever mosquito, silkworm, dog hookworm, hydra, and soybean rust. We demonstrate that recombination signal sequences (RSSs) were derived from terminal inverted repeats of an ancient Transib transposon. Furthermore, the critical DDE catalytic triad of RAG1 is shared with the Transib transposase as part of conserved motifs. We also studied several divergent proteins encoded by the sea urchin and lancelet genomes that are 25%-30% identical to the RAG1 N-terminal domain and the RAG1 core. Our results provide the first direct evidence linking RAG1 and RSSs to a specific superfamily of DNA transposons and indicate that the V(D)J machinery evolved from transposons. We propose that only the RAG1 core was derived from the Transib transposase, whereas the N-terminal domain was assembled from separate proteins of unknown function that may still be active in sea urchin, lancelet, hydra, and starlet sea anemone. We also suggest that the RAG2 protein was not encoded by ancient Transib transposons but emerged in jawed vertebrates as a counterpart of RAG1 necessary for the V(D)J recombination reaction.
Assuntos
Anemone/genética , Elementos de DNA Transponíveis/genética , Genes RAG-1 , Genoma , Hydra/genética , Recombinação Genética , Ouriços-do-Mar/genética , VDJ Recombinases/metabolismo , Aedes/genética , Animais , Drosophila/genética , Proteínas de Homeodomínio/genética , HumanosRESUMO
Short INterspersed Elements (SINEs) are non-autonomous retrotransposons, usually between 100 and 500 base pairs (bp) in length, which are ubiquitous components of eukaryotic genomes. Their activity, distribution, and evolution can be highly informative on genomic structure and evolutionary processes. To determine recent activity, we amplified more than one hundred SINE1 loci in a panel of 43 M. domestica individuals derived from five diverse geographic locations. The SINE1 family has expanded recently enough that many loci were polymorphic, and the SINE1 insertion-based genetic distances among populations reflected geographic distance. Genome-wide comparisons of SINE1 densities and GC content revealed that high SINE1 density is associated with high GC content in a few long and many short spans. Young SINE1s, whether fixed or polymorphic, showed an unbiased GC content preference for insertion, indicating that the GC preference accumulates over long time periods, possibly in periodic bursts. SINE1 evolution is thus broadly similar to human Alu evolution, although it has an independent origin. High GC content adjacent to SINE1s is strongly correlated with bias towards higher AT to GC substitutions and lower GC to AT substitutions. This is consistent with biased gene conversion, and also indicates that like chickens, but unlike eutherian mammals, GC content heterogeneity (isochore structure) is reinforced by substitution processes in the M. domestica genome. Nevertheless, both high and low GC content regions are apparently headed towards lower GC content equilibria, possibly due to a relative shift to lower recombination rates in the recent Monodelphis ancestral lineage. Like eutherians, metatherian (marsupial) mammals have evolved high CpG substitution rates, but this is apparently a convergence in process rather than a shared ancestral state.
Assuntos
Evolução Molecular , Genoma/genética , Gambás/genética , Elementos Nucleotídeos Curtos e Dispersos/genética , Animais , Composição de Bases/genética , Cromossomos de Mamíferos/genética , Geografia , Filogenia , Polimorfismo Genético , Dinâmica Populacional , Fatores de Tempo , Cromossomo X/genéticaRESUMO
Genetic variability of the compound interrupted microsatellite DXS1238, in intron 44 of the dystrophin gene, provides evidence for a complex structure of the ancestral population that led to the emergence of modern humans. We sequenced DXS1238 in 600 X-chromosomes from all over the world. Forty four percent of African-specific chromosomes belong to the ancestral lineage that did not participate in the out-of-Africa expansion and subsequent colonization of other continents. Based on the coalescence analysis these lineages separated from those that contributed to the out-of-Africa expansion 366 +/- 136 thousands years ago (Kya). Independently, the analysis of the variance in the repeat length and of the decay of the ancestral alleles of the two DXS1238 repeats, GT and GA, dates this separation at more than 200 Kya. This suggests a complex demographic history and genetic structure of the African melting pot that led to the emergence of modern humans and their out-of-Africa migration. The subsequent subdivisions of human populations among different continents appear to be preceded by even more structured population history within Africa itself, which resulted from a restricted gene flow between lineages allowing for genetic differences to accumulate. If the transition to modern humans occurred during that time, it necessarily follows that genes associated with this transformation spread between subpopulations via gene flow. Otherwise, in spite of subsequent anatomical variation, Homo sapiens as a species could have emerged in Africa already between 300 and 200 Kya, i.e. before the mitochondrial DNA and well before the Y-chromosome most recent common ancestors.