Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 16 de 16
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Nat Commun ; 8: 14049, 2017 01 16.
Artículo en Inglés | MEDLINE | ID: mdl-28091601

RESUMEN

Characterizing the transcriptome of individual cells is fundamental to understanding complex biological systems. We describe a droplet-based system that enables 3' mRNA counting of tens of thousands of single cells per sample. Cell encapsulation, of up to 8 samples at a time, takes place in ∼6 min, with ∼50% cell capture efficiency. To demonstrate the system's technical performance, we collected transcriptome data from ∼250k single cells across 29 samples. We validated the sensitivity of the system and its ability to detect rare populations using cell lines and synthetic RNAs. We profiled 68k peripheral blood mononuclear cells to demonstrate the system's ability to characterize large immune populations. Finally, we used sequence variation in the transcriptome data to determine host and donor chimerism at single-cell resolution from bone marrow mononuclear cells isolated from transplant patients.


Asunto(s)
Leucocitos Mononucleares/metabolismo , Transcriptoma , Línea Celular , Femenino , Humanos , Leucocitos Mononucleares/química , Masculino , ARN Mensajero/genética , ARN Mensajero/metabolismo , Análisis de la Célula Individual
2.
PLoS One ; 10(10): e0141836, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-26510133

RESUMEN

TAR DNA-binding protein 43 (TDP-43) is normally a nuclear RNA-binding protein that exhibits a range of functions including regulation of alternative splicing, RNA trafficking, and RNA stability. However, in amyotrophic lateral sclerosis (ALS) and frontotemporal lobar degeneration with TDP-43 inclusions (FTLD-TDP), TDP-43 is abnormally phosphorylated, ubiquitinated, and cleaved, and is mislocalized to the cytoplasm where it forms distinctive aggregates. We previously developed a mouse model expressing human TDP-43 with a mutation in its nuclear localization signal (ΔNLS-hTDP-43) so that the protein preferentially localizes to the cytoplasm. These mice did not exhibit a significant number of cytoplasmic aggregates, but did display dramatic changes in gene expression as measured by microarray, suggesting that cytoplasmic TDP-43 may be associated with a toxic gain-of-function. Here, we analyze new RNA-sequencing data from the ΔNLS-hTDP-43 mouse model, together with published RNA-sequencing data obtained previously from TDP-43 antisense oligonucleotide (ASO) knockdown mice to investigate further the dysregulation of gene expression in the ΔNLS model. This analysis reveals that the transcriptomic effects of the overexpression of the ΔNLS-hTDP-43 transgene are likely due to a gain of cytoplasmic function. Moreover, cytoplasmic TDP-43 expression alters transcripts that regulate chromatin assembly, the nucleolus, lysosomal function, and histone 3' untranslated region (UTR) processing. These transcriptomic alterations correlate with observed histologic abnormalities in heterochromatin structure and nuclear size in transgenic mouse and human brains.


Asunto(s)
Cromatina/genética , Proteínas de Unión al ADN/genética , Histonas/genética , Transcriptoma , Regiones no Traducidas 3' , Animales , Núcleo Celular/metabolismo , Cromatina/metabolismo , Biología Computacional/métodos , Citoplasma/metabolismo , Perfilación de la Expresión Génica , Técnicas de Silenciamiento del Gen , Histonas/metabolismo , Humanos , Ratones , Ratones Transgénicos , Señales de Localización Nuclear/genética , Empalme del ARN , Reproducibilidad de los Resultados , Eliminación de Secuencia
3.
Fetal Diagn Ther ; 36(4): 282-6, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-25228026

RESUMEN

OBJECTIVE: To develop a microarray-based method for noninvasive prenatal testing (NIPT) and compare it with next-generation sequencing. METHODS: Maternal plasma from 878 pregnant women, including 187 trisomy cases (18 trisomy 13, 37 trisomy 18, 132 trisomy 21), was evaluated for trisomy risk. Targeted chromosomes were analyzed using Digital Analysis of Selected Regions (DANSR™) assays. DANSR products were subsequently divided between two DNA quantification methods: microarrays and next-generation sequencing. For both microarray and sequencing methodologies, the Fetal-Fraction Optimized Risk of Trisomy Evaluation (FORTE™) algorithm was used to determine trisomy risk, assay variability across samples, and compute fetal fraction variability within samples. RESULTS: NIPT using microarrays provided faster and more accurate cell-free DNA (cfDNA) measurements than sequencing. The assay variability, a measure of variance of chromosomal cfDNA counts, was lower for microarrays than for sequencing, 0.051 versus 0.099 (p < 0.0001). Analysis time using microarrays was faster, 7.5 versus 56 h for sequencing. Additionally, fetal fraction precision was improved 1.6-fold by assaying more polymorphic sites with microarrays (p < 0.0001). Microarrays correctly classified all trisomy and nontrisomy cases. CONCLUSIONS: NIPT using microarrays delivers more accurate cfDNA analysis than next-generation sequencing and can be performed in less time.


Asunto(s)
Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Diagnóstico Prenatal/métodos , Trisomía/diagnóstico , Adulto , Aneuploidia , ADN/sangre , Femenino , Humanos , Embarazo , Análisis de Secuencia de ADN , Trisomía/genética
4.
Methods ; 67(1): 28-35, 2014 May 01.
Artículo en Inglés | MEDLINE | ID: mdl-24145223

RESUMEN

Recent advances in high-throughput sequencing allow researchers to examine the transcriptome in more detail than ever before. Using a method known as high-throughput small RNA-sequencing, we can now profile the expression of small regulatory RNAs such as microRNAs and small interfering RNAs (siRNAs) with a great deal of sensitivity. However, there are many other types of small RNAs (<50nt) present in the cell, including fragments derived from snoRNAs (small nucleolar RNAs), snRNAs (small nuclear RNAs), scRNAs (small cytoplasmic RNAs), tRNAs (transfer RNAs), and transposon-derived RNAs. Here, we present a user's guide for CoRAL (Classification of RNAs by Analysis of Length), a computational method for discriminating between different classes of RNA using high-throughput small RNA-sequencing data. Not only can CoRAL distinguish between RNA classes with high accuracy, but it also uses features that are relevant to small RNA biogenesis pathways. By doing so, CoRAL can give biologists a glimpse into the characteristics of different RNA processing pathways and how these might differ between tissue types, biological conditions, or even different species. CoRAL is available at http://wanglab.pcbi.upenn.edu/coral/.


Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento , ARN Pequeño no Traducido/clasificación , Análisis de Secuencia de ARN , Algoritmos , Animales , Inteligencia Artificial , Secuencia de Bases , Árboles de Decisión , Entropía , Humanos , Secuencias Invertidas Repetidas , Datos de Secuencia Molecular , Conformación de Ácido Nucleico , Procesamiento Postranscripcional del ARN , ARN Pequeño no Traducido/genética
5.
RNA ; 19(12): 1684-92, 2013 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-24149843

RESUMEN

RNA is often altered post-transcriptionally by the covalent modification of particular nucleotides; these modifications are known to modulate the structure and activity of their host RNAs. The recent discovery that an RNA methyl-6 adenosine demethylase (FTO) is a risk gene in obesity has brought to light the significance of RNA modifications to human biology. These noncanonical nucleotides, when converted to cDNA in the course of RNA sequencing, can produce sequence patterns that are distinguishable from simple base-calling errors. To determine whether these modifications can be detected in RNA sequencing data, we developed a method that can not only locate these modifications transcriptome-wide with single nucleotide resolution, but can also differentiate between different classes of modifications. Using small RNA-seq data we were able to detect 92% of all known human tRNA modification sites that are predicted to affect RT activity. We also found that different modifications produce distinct patterns of cDNA sequence, allowing us to differentiate between two classes of adenosine and two classes of guanine modifications with 98% and 79% accuracy, respectively. To show the robustness of this method to sample preparation and sequencing methods, as well as to organismal diversity, we applied it to a publicly available yeast data set and achieved similar levels of accuracy. We also experimentally validated two novel and one known 3-methylcytosine (3mC) sites predicted by HAMR in human tRNAs. Researchers can now use our method to identify and characterize RNA modifications using only RNA-seq data, both retrospectively and when asking questions specifically about modified RNA.


Asunto(s)
Anotación de Secuencia Molecular/métodos , Procesamiento Postranscripcional del ARN , ARN de Transferencia/genética , Programas Informáticos , Femenino , Células HEK293 , Humanos , Masculino , ARN/genética , ARN/metabolismo , ARN de Transferencia/metabolismo , Saccharomyces cerevisiae/genética , Alineación de Secuencia , Análisis de Secuencia de ARN
6.
PLoS One ; 8(10): e74578, 2013.
Artículo en Inglés | MEDLINE | ID: mdl-24098339

RESUMEN

Several recent gene expression studies identified hundreds of genes that are correlated with age in brain and other tissues in human. However, these studies used linear models of age correlation, which are not well equipped to model abrupt changes associated with particular ages. We developed a computational algorithm for age estimation in which the expression of each gene is treated as a dichotomized biomarker for whether the subject is older or younger than a particular age. In addition, for each age-informative gene our algorithm identifies the age threshold with the most drastic change in expression level, which allows us to associate genes with particular age periods. Analysis of human aging brain expression datasets from three frontal cortex regions showed that different pathways undergo transitions at different ages, and the distribution of pathways and age thresholds varies across brain regions. Our study reveals age-correlated expression changes at particular age points and allows one to estimate the age of an individual with better accuracy than previously published methods.


Asunto(s)
Envejecimiento/genética , Encéfalo/metabolismo , Biología Computacional/métodos , Dinámicas no Lineales , Transcriptoma/fisiología , Adulto , Anciano , Envejecimiento/metabolismo , Teorema de Bayes , Regulación hacia Abajo/fisiología , Femenino , Humanos , Masculino , Persona de Mediana Edad , Mapas de Interacción de Proteínas/genética
7.
Genes Dev ; 27(12): 1406-20, 2013 Jun 15.
Artículo en Inglés | MEDLINE | ID: mdl-23756653

RESUMEN

Cellular senescence is accompanied by dramatic changes in chromatin structure and gene expression. Using Saccharomyces cerevisiae mutants lacking telomerase (tlc1Δ) to model senescence, we found that with critical telomere shortening, the telomere-binding protein Rap1 (repressor activator protein 1) relocalizes to the upstream promoter regions of hundreds of new target genes. The set of new Rap1 targets at senescence (NRTS) is preferentially activated at senescence, and experimental manipulations of Rap1 levels indicate that it contributes directly to NRTS activation. A notable subset of NRTS includes the core histone-encoding genes; we found that Rap1 contributes to their repression and that histone protein levels decline at senescence. Rap1 and histones also display a target site-specific antagonism that leads to diminished nucleosome occupancy at the promoters of up-regulated NRTS. This antagonism apparently impacts the rate of senescence because underexpression of Rap1 or overexpression of the core histones delays senescence. Rap1 relocalization is not a simple consequence of lost telomere-binding sites, but rather depends on the Mec1 checkpoint kinase. Rap1 relocalization is thus a novel mechanism connecting DNA damage responses (DDRs) at telomeres to global changes in chromatin and gene expression while driving the pace of senescence.


Asunto(s)
Cromatina/metabolismo , Regulación Fúngica de la Expresión Génica , Proteínas de Saccharomyces cerevisiae/metabolismo , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Proteínas de Unión a Telómeros/metabolismo , Factores de Transcripción/metabolismo , Histonas/genética , Viabilidad Microbiana , Transporte de Proteínas , Saccharomyces cerevisiae/citología , Proteínas de Saccharomyces cerevisiae/genética , Complejo Shelterina , Telómero/genética , Telómero/metabolismo , Proteínas de Unión a Telómeros/genética , Factores de Transcripción/genética , Transcriptoma
8.
Nucleic Acids Res ; 41(14): e137, 2013 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-23700308

RESUMEN

The surprising observation that virtually the entire human genome is transcribed means we know little about the function of many emerging classes of RNAs, except their astounding diversities. Traditional RNA function prediction methods rely on sequence or alignment information, which are limited in their abilities to classify the various collections of non-coding RNAs (ncRNAs). To address this, we developed Classification of RNAs by Analysis of Length (CoRAL), a machine learning-based approach for classification of RNA molecules. CoRAL uses biologically interpretable features including fragment length and cleavage specificity to distinguish between different ncRNA populations. We evaluated CoRAL using genome-wide small RNA sequencing data sets from four human tissue types and were able to classify six different types of RNAs with ∼80% cross-validation accuracy. Analysis by CoRAL revealed that microRNAs, small nucleolar and transposon-derived RNAs are highly discernible and consistent across all human tissue types assessed, whereas long intergenic ncRNAs, small cytoplasmic RNAs and small nuclear RNAs show less consistent patterns. The ability to reliably annotate loci across tissue types demonstrates the potential of CoRAL to characterize ncRNAs using small RNA sequencing data in less well-characterized organisms.


Asunto(s)
Inteligencia Artificial , ARN Pequeño no Traducido/clasificación , Análisis de Secuencia de ARN , Algoritmos , Clasificación/métodos , Humanos , ARN Pequeño no Traducido/química
9.
Cell Rep ; 1(1): 69-82, 2012 Jan 26.
Artículo en Inglés | MEDLINE | ID: mdl-22832108

RESUMEN

The secondary structure of RNA is necessary for its maturation, regulation, processing, and function. However, the global influence of RNA folding in eukaryotes is still unclear. Here, we use a high-throughput, sequencing-based, structure-mapping approach to identify the paired (double-stranded RNA [dsRNA]) and unpaired (single-stranded RNA [ssRNA]) components of the Drosophila melanogaster and Caenorhabditis elegans transcriptomes, which allows us to identify conserved features of RNA secondary structure in metazoans. From this analysis, we find that ssRNAs and dsRNAs are significantly correlated with specific epigenetic modifications. Additionally, we find key structural patterns across protein-coding transcripts that indicate that RNA folding demarcates regions of protein translation and likely affects microRNA-mediated regulation of mRNAs in animals. Finally, we identify and characterize 546 mRNAs whose folding pattern is significantly correlated between these metazoans, suggesting that their structure has some function. Overall, our findings provide a global assessment of RNA folding in animals.


Asunto(s)
Caenorhabditis elegans/genética , Drosophila melanogaster/genética , Conformación de Ácido Nucleico , ARN/química , Animales , Emparejamiento Base/genética , Secuencia de Bases , Cromosomas/genética , Secuencia Conservada , Epigénesis Genética , Genoma/genética , MicroARNs/metabolismo , Datos de Secuencia Molecular , Biosíntesis de Proteínas/genética , ARN/genética , ARN Bicatenario/química , ARN Bicatenario/genética , ARN de Helminto/química , ARN de Helminto/genética , ARN Mensajero/química , Transcriptoma/genética
10.
Methods ; 57(1): 3-10, 2012 May.
Artículo en Inglés | MEDLINE | ID: mdl-22652626

RESUMEN

Bioinformatic approaches to the identification of genomic sequences having G-quadruplex forming potential (QFP) has enabled important tests of the structure of these sequences in vitro and of their behavior under conditions where the formation or function of G-quadruplexes is modulated in vivo. Several similar approaches to identifying intramolecular QFP (i.e. forming among G-runs on one strand of DNA) have been developed previously, but none appears to perfectly predict G-quadruplex formation. Here we describe a new approach, which complements and differs from prior approaches in that it identifies motifs containing G-runs on both strands of duplex DNA that could contribute to G-quadruplex structures. We call these motifs duplex-derived interstrand QFP (ddiQFP), and illustrate their potential applications by describing their genomic distribution and an example of their correspondence to loci targeted by a G-quadruplex-unwinding DNA helicase in yeast.


Asunto(s)
ADN Helicasas/química , ADN/química , G-Cuádruplex , Dicroismo Circular , Biología Computacional/métodos , Genómica , Guanina/química , Conformación de Ácido Nucleico , Telómero/química
11.
Nucleic Acids Res ; 40(Web Server issue): W59-64, 2012 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-22492627

RESUMEN

RNA secondary structure is required for the proper regulation of the cellular transcriptome. This is because the functionality, processing, localization and stability of RNAs are all dependent on the folding of these molecules into intricate structures through specific base pairing interactions encoded in their primary nucleotide sequences. Thus, as the number of RNA sequencing (RNA-seq) data sets and the variety of protocols for this technology grow rapidly, it is becoming increasingly pertinent to develop tools that can analyze and visualize this sequence data in the context of RNA secondary structure. Here, we present Sequencing Annotation and Visualization of RNA structures (SAVoR), a web server, which seamlessly links RNA structure predictions with sequencing data and genomic annotations to produce highly informative and annotated models of RNA secondary structure. SAVoR accepts read alignment data from RNA-seq experiments and computes a series of per-base values such as read abundance and sequence variant frequency. These values can then be visualized on a customizable secondary structure model. SAVoR is freely available at http://tesla.pcbi.upenn.edu/savor.


Asunto(s)
ARN/química , Programas Informáticos , Internet , Modelos Moleculares , Anotación de Secuencia Molecular , Conformación de Ácido Nucleico , Análisis de Secuencia de ARN
12.
PLoS Genet ; 6(9): e1001141, 2010 Sep 30.
Artículo en Inglés | MEDLINE | ID: mdl-20941385

RESUMEN

The functional structure of all biologically active molecules is dependent on intra- and inter-molecular interactions. This is especially evident for RNA molecules whose functionality, maturation, and regulation require formation of correct secondary structure through encoded base-pairing interactions. Unfortunately, intra- and inter-molecular base-pairing information is lacking for most RNAs. Here, we marry classical nuclease-based structure mapping techniques with high-throughput sequencing technology to interrogate all base-paired RNA in Arabidopsis thaliana and identify ∼200 new small (sm)RNA-producing substrates of RNA-DEPENDENT RNA POLYMERASE6. Our comprehensive analysis of paired RNAs reveals conserved functionality within introns and both 5' and 3' untranslated regions (UTRs) of mRNAs, as well as a novel population of functional RNAs, many of which are the precursors of smRNAs. Finally, we identify intra-molecular base-pairing interactions to produce a genome-wide collection of RNA secondary structure models. Although our methodology reveals the pairing status of RNA molecules in the absence of cellular proteins, previous studies have demonstrated that structural information obtained for RNAs in solution accurately reflects their structure in ribonucleoprotein complexes. Furthermore, our identification of RNA-DEPENDENT RNA POLYMERASE6 substrates and conserved functional RNA domains within introns and both 5' and 3' untranslated regions (UTRs) of mRNAs using this approach strongly suggests that RNA molecules are correctly folded into their secondary structure in solution. Overall, our findings highlight the importance of base-paired RNAs in eukaryotes and present an approach that should be widely applicable for the analysis of this key structural feature of RNA.


Asunto(s)
Arabidopsis/genética , Emparejamiento Base/genética , Genoma de Planta/genética , ARN Bicatenario/genética , ARN de Planta/genética , Análisis de Secuencia de ARN/métodos , Proteínas de Arabidopsis/metabolismo , Secuencia Conservada/genética , Perfilación de la Expresión Génica , Regulación de la Expresión Génica de las Plantas , Genómica , Intrones/genética , ARN Mensajero/química , ARN Mensajero/genética , ARN Mensajero/metabolismo , ARN de Planta/química , ARN Nuclear Pequeño/química , ARN Nuclear Pequeño/genética , ARN Polimerasa Dependiente del ARN/metabolismo , Especificidad por Sustrato , Regiones no Traducidas/genética
13.
Nucleic Acids Res ; 38(4): 1114-22, 2010 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-19966276

RESUMEN

The human Werner and Bloom syndromes (WS and BS) are caused by deficiencies in the WRN and BLM RecQ helicases, respectively. WRN, BLM and their Saccharomyces cerevisiae homologue Sgs1, are particularly active in vitro in unwinding G-quadruplex DNA (G4-DNA), a family of non-canonical nucleic acid structures formed by certain G-rich sequences. Recently, mRNA levels from loci containing potential G-quadruplex-forming sequences (PQS) were found to be preferentially altered in sgs1Delta mutants, suggesting that G4-DNA targeting by Sgs1 directly affects gene expression. Here, we extend these findings to human cells. Using microarrays to measure mRNAs obtained from human fibroblasts deficient for various RecQ family helicases, we observe significant associations between loci that are upregulated in WS or BS cells and loci that have PQS. No such PQS associations were observed for control expression datasets, however. Furthermore, upregulated genes in WS and BS showed no or dramatically reduced associations with sequences similar to PQS but that have considerably reduced potential to form intramolecular G4-DNA. These findings indicate that, like Sgs1, WRN and BLM can regulate transcription globally by targeting G4-DNA.


Asunto(s)
Síndrome de Bloom/genética , ADN/química , G-Cuádruplex , Regulación de la Expresión Génica , RecQ Helicasas/deficiencia , Síndrome de Werner/genética , Secuencia de Bases , Síndrome de Bloom/metabolismo , Línea Celular , Fibroblastos/metabolismo , Perfilación de la Expresión Génica , Humanos , Regulación hacia Arriba , Síndrome de Werner/metabolismo
14.
Methods Mol Biol ; 608: 39-50, 2010.
Artículo en Inglés | MEDLINE | ID: mdl-20012414

RESUMEN

Sequences with the potential to form intramolecular G-quadruplexes (G4-structures) are found in highly nonrandom distributions in the genomes of diverse organisms. These sequences are associated with nucleic acid metabolic processes ranging from transcription and translation to recombination and telomere function. Here we review different computational methods for identifying potential G4-forming sequences and provide protocols for their implementation. We also discuss methods for assessing the significance and specificity of associations between the sequences and different biological functions.


Asunto(s)
Biología Computacional/métodos , G-Cuádruplex , Guanina/química , Cartilla de ADN/química , Humanos
15.
J Comput Biol ; 16(10): 1429-44, 2009 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-19803737

RESUMEN

Gene duplication has long been recognized as a major force in genome evolution and has recently been recognized as an important source of individual variation. For many years, the origin of functional gene duplicates was assumed to be whole or partial genome duplication events, but recently retrotransposition has also been shown to contribute new functional protein coding genes and siRNA's. In this study, we utilize pseudogenes to recreate more complete gene family histories, and compare the rates of RNA and DNA-mediated duplication and new functional gene formation in five mammalian genomes. We find that RNA-mediated duplication occurs at a much higher and more variable rate than DNA-mediated duplication, and gives rise to many more duplicated sequences over time. We show that, while the chance of RNA-mediated duplicates becoming functional is much lower than that of their DNA-mediated counterparts, the higher rate of retrotransposition leads to nearly equal contributions of new genes by each mechanism. We also find that functional RNA-mediated duplicates are closer to neighboring genes than non-functional RNA-mediated copies, consistent with co-option of regulatory elements at the site of insertion. Overall, new genes derived from DNA and RNA-mediated duplication mechanisms are under similar levels of purifying selective pressure, but have broadly different functions. RNA-mediated duplication gives rise to a diversity of genes but is dominated by the highly expressed genes of RNA metabolic pathways. DNA-mediated duplication can copy regulatory material along with the protein coding region of the gene and often gives rise to classes of genes whose function are dependent on complex regulatory information. This mechanistic difference may in part explain why we find that mammalian protein families tend to evolve by either one mechanism or the other, but rarely by both. Supplementary Material has been provided (see online Supplementary Material at www.liebertonline.com ).


Asunto(s)
Evolución Biológica , ADN/genética , Genes Duplicados , ARN/genética , Animales , Genoma Humano , Humanos , Modelos Genéticos , Familia de Multigenes , Filogenia , Sintenía
16.
J Comput Biol ; 16(9): 1253-66, 2009 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-19772436

RESUMEN

Here we identify duplicated genes in five mammalian genomes and classify these duplicates based on the mechanisms by which they were generated. Retrotransposition accounts for at least half of all predicted duplicate genes in these genomes, with tandem and interspersed DNA-mediated duplicates comprising the other half. Estimation of the evolutionary rates in each class revealed greater rate asymmetry between retrotransposed and interspersed DNA duplicate pairs than between tandem duplicates, suggesting that retrotransposed and interspersed DNA duplicates are diverging more quickly. In an attempt to understand the basis of this asymmetry, we identified disruption of flanking DNA as an indicator of new duplicate fate-loss of local synteny accelerates the asymmetry of divergence of interspersed DNA duplicates. We also show that intact retrogenes are enriched in intergenic regions and indel purified regions of the human genome. Moreover, intact retrogenes closest to annotated genes show the greatest levels of purifying selective pressure. Together, these findings suggest that the differential evolution of duplicate genes may be significantly influenced by changes in local genome architecture.


Asunto(s)
ADN/genética , Duplicación de Gen , Genoma , Animales , Perros , Evolución Molecular , Humanos , Ratones , Modelos Genéticos , Pan troglodytes , Ratas
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA