Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 141
Filtrar
Mais filtros










Filtros aplicados
Base de dados
Intervalo de ano de publicação
1.
J Virol ; 96(5): e0157321, 2022 03 09.
Artigo em Inglês | MEDLINE | ID: mdl-34985997

RESUMO

Bracoviruses (BVs) are endogenized nudiviruses in parasitoid wasps of the microgastroid complex (family Braconidae). Microgastroid wasps have coopted nudivirus genes to produce replication-defective virions that females use to transfer virulence genes to parasitized hosts. The microgastroid complex further consists of six subfamilies and ∼50,000 species but current understanding of BV gene inventories and organization primarily derives from analysis of two wasp species in the subfamily Microgastrinae (Microplitis demolitor and Cotesia congregata) that produce M. demolitor BV (MdBV) and C. congregata BV (CcBV). Notably, several genomic features of MdBV and CcBV remain conserved since divergence of M. demolitor and C. congregata ∼53 million years ago (MYA). However, it is unknown whether these conserved traits more broadly reflect BV evolution, because no complete genomes exist for any microgastroid wasps outside the Microgastrinae. In this regard, the subfamily Cheloninae is of greatest interest because it diverged earliest from the Microgastrinae (∼85 MYA) after endogenization of the nudivirus ancestor. Here, we present the complete genome of Chelonus insularis, which is an egg-larval parasitoid in the Cheloninae that produces C. insularis BV (CinsBV). We report that the inventory of nudivirus genes in C. insularis is conserved but are dissimilarly organized compared to M. demolitor and C. congregata. Reciprocally, CinsBV proviral segments share organizational features with MdBV and CcBV but virulence gene inventories exhibit almost no overlap. Altogether, our results point to the functional importance of a conserved inventory of nudivirus genes and a dynamic set of virulence genes for the successful parasitism of hosts. Our results also suggest organizational features previously identified in MdBV and CcBV are likely not essential for BV virion formation. IMPORTANCE Bracoviruses are a remarkable example of virus endogenization, because large sets of genes from a nudivirus ancestor continue to produce virions that thousands of wasp species rely upon to parasitize hosts. Understanding how these genes interact and have been coopted by wasps for novel functions is of broad interest in the study of virus evolution. This work characterizes bracovirus genome components in the parasitoid wasp Chelonus insularis, which together with existing wasp genomes captures a large portion of the diversity among wasp species that produce bracoviruses. Results provide new information about how bracovirus genome components are organized in different wasps while also providing additional insights on key features required for function.


Assuntos
Genoma de Inseto , Polydnaviridae , Vespas , Animais , Feminino , Componentes Genômicos/genética , Genoma de Inseto/genética , Nudiviridae/genética , Polydnaviridae/genética , Polydnaviridae/patogenicidade , Provírus/genética , Fatores de Virulência/genética , Vespas/classificação , Vespas/genética , Vespas/virologia
2.
STAR Protoc ; 2(3): 100799, 2021 09 17.
Artigo em Inglês | MEDLINE | ID: mdl-34527956

RESUMO

Reorganization of topologically associated domain (TAD) is considered to be a novel mechanism for cell fate transitions. Here, we present a protocol to manipulate TAD via abscisic acid (ABA)-dependent genome linking. We use this protocol to merge two adjacent TADs and evaluate the influence on cell fate transitions. The advantages are that the manipulation does not change the genome and is reversible by withdrawing ABA. The major challenge is how to select linking loci for efficient TAD reorganization. For complete details on the use and execution of this protocol, please refer to Wang et al. (2021).


Assuntos
Diferenciação Celular/genética , Técnicas Citológicas/métodos , Componentes Genômicos , Genômica/métodos , Ácido Abscísico/farmacologia , Animais , Linhagem Celular , Genoma/efeitos dos fármacos , Genoma/genética , Componentes Genômicos/efeitos dos fármacos , Componentes Genômicos/genética , Humanos , Camundongos
3.
Nucleic Acids Res ; 49(10): 5705-5725, 2021 06 04.
Artigo em Inglês | MEDLINE | ID: mdl-33978759

RESUMO

Gene expression is controlled by regulatory elements within accessible chromatin. Although most regulatory elements are cell type-specific, a subset is accessible in nearly all the 517 human and 94 mouse cell and tissue types assayed by the ENCODE consortium. We systematically analyzed 9000 human and 8000 mouse ubiquitously-accessible candidate cis-regulatory elements (cCREs) with promoter-like signatures (PLSs) from ENCODE, which we denote ubi-PLSs. These are more CpG-rich than non-ubi-PLSs and correspond to genes with ubiquitously high transcription, including a majority of cell-essential genes. ubi-PLSs are enriched with motifs of ubiquitously-expressed transcription factors and preferentially bound by transcriptional cofactors regulating ubiquitously-expressed genes. They are highly conserved between human and mouse at the synteny level but exhibit frequent turnover of motif sites; accordingly, ubi-PLSs show increased variation at their centers compared with flanking regions among the ∼186 thousand human genomes sequenced by the TOPMed project. Finally, ubi-PLSs are enriched in genes implicated in Mendelian diseases, especially diseases broadly impacting most cell types, such as deficiencies in mitochondrial functions. Thus, a set of roughly 9000 mammalian promoters are actively maintained in an accessible state across cell types by a distinct set of transcription factors and cofactors to ensure the transcriptional programs of cell-essential genes.


Assuntos
Cromatina/metabolismo , Epigênese Genética , Regulação da Expressão Gênica/genética , Sequências Reguladoras de Ácido Nucleico , Fatores de Transcrição/metabolismo , Transcriptoma/genética , Motivos de Aminoácidos , Animais , Composição de Bases , Cromatina/genética , Metilação de DNA , Proteínas de Ligação a DNA/genética , Proteínas de Ligação a DNA/metabolismo , Bases de Dados Genéticas , Epigenômica , Ontologia Genética , Genes Essenciais , Componentes Genômicos , Genoma Humano , Humanos , Camundongos , Proteínas de Neoplasias/genética , Proteínas de Neoplasias/metabolismo , Proteínas Nucleares/genética , Proteínas Nucleares/metabolismo , Especificidade de Órgãos/genética , Regiões Promotoras Genéticas , Proteínas Repressoras/genética , Proteínas Repressoras/metabolismo , TATA Box , Fatores de Transcrição/genética
4.
Commun Biol ; 4(1): 283, 2021 03 05.
Artigo em Inglês | MEDLINE | ID: mdl-33674780

RESUMO

Much like humans, chimpanzees occupy diverse habitats and exhibit extensive behavioural variability. However, chimpanzees are recognized as a discontinuous species, with four subspecies separated by historical geographic barriers. Nevertheless, their range-wide degree of genetic connectivity remains poorly resolved, mainly due to sampling limitations. By analyzing a geographically comprehensive sample set amplified at microsatellite markers that inform recent population history, we found that isolation by distance explains most of the range-wide genetic structure of chimpanzees. Furthermore, we did not identify spatial discontinuities corresponding with the recognized subspecies, suggesting that some of the subspecies-delineating geographic barriers were recently permeable to gene flow. Substantial range-wide genetic connectivity is consistent with the hypothesis that behavioural flexibility is a salient driver of chimpanzee responses to changing environmental conditions. Finally, our observation of strong local differentiation associated with recent anthropogenic pressures portends future loss of critical genetic diversity if habitat fragmentation and population isolation continue unabated.


Assuntos
Comportamento Animal , Evolução Molecular , Variação Genética , Componentes Genômicos , Repetições de Microssatélites , Pan troglodytes/genética , Migração Animal , Animais , Ecossistema , Interação Gene-Ambiente , Genética Populacional , Pan troglodytes/psicologia , Filogenia , Especificidade da Espécie
5.
Methods Mol Biol ; 2219: 289-305, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33074549

RESUMO

Miniaturization, which is a common feature in animals, is particularly manifest in meiofauna-animals sharing peculiar phenotypic features that evolved as adaptations to the highly specialized aquatic interstitial habitat. While revealing much about the extreme phyletic diversity of meiofauna, the genome structure of meiofaunal species could also characterize the phenotype of ancestral states as well as explain the origin and evolution of miniaturization. Here, we present a practical bioinformatics tutorial for genome assembly, genome comparison, and characterization of Hox clusters in meiofaunal species.


Assuntos
Genômica/métodos , Animais , Biodiversidade , Evolução Biológica , Biologia Computacional/métodos , Ecossistema , Genes Homeobox , Genoma , Componentes Genômicos , Filogenia , Software
6.
Mol Cell ; 78(3): 539-553.e8, 2020 05 07.
Artigo em Inglês | MEDLINE | ID: mdl-32213323

RESUMO

Whereas folding of genomes at the large scale of epigenomic compartments and topologically associating domains (TADs) is now relatively well understood, how chromatin is folded at finer scales remains largely unexplored in mammals. Here, we overcome some limitations of conventional 3C-based methods by using high-resolution Micro-C to probe links between 3D genome organization and transcriptional regulation in mouse stem cells. Combinatorial binding of transcription factors, cofactors, and chromatin modifiers spatially segregates TAD regions into various finer-scale structures with distinct regulatory features including stripes, dots, and domains linking promoters-to-promoters (P-P) or enhancers-to-promoters (E-P) and bundle contacts between Polycomb regions. E-P stripes extending from the edge of domains predominantly link co-expressed loci, often in the absence of CTCF and cohesin occupancy. Acute inhibition of transcription disrupts these gene-related folding features without altering higher-order chromatin structures. Our study uncovers previously obscured finer-scale genome organization, establishing functional links between chromatin folding and gene regulation.


Assuntos
Montagem e Desmontagem da Cromatina/genética , Cromatina/química , Cromatina/metabolismo , Transcrição Gênica , Animais , Fator de Ligação a CCCTC/genética , Cromatina/genética , DNA Polimerase II/genética , DNA Polimerase II/metabolismo , Células-Tronco Embrionárias/fisiologia , Elementos Facilitadores Genéticos , Regulação da Expressão Gênica , Componentes Genômicos , Camundongos , Regiões Promotoras Genéticas , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo
7.
Planta ; 251(3): 72, 2020 Feb 28.
Artigo em Inglês | MEDLINE | ID: mdl-32112137

RESUMO

MAIN CONCLUSION: This study provides broad insight into the chloroplast genomes of the subfamily Monsteroideae. The identified polymorphic regions may be suitable for designing unique and robust molecular markers for phylogenetic inference. Monsteroideae is the third largest subfamily (comprises 369 species) and one of the early diverging lineages of the monocot plant family Araceae. The phylogeny of this important subfamily is not well resolved at the species level due to scarcity of genomic resources and suitable molecular markers. Here, we report annotated chloroplast genome sequences of four Monsteroideae species: Spathiphyllum patulinervum, Stenospermation multiovulatum, Monstera adansonii, and Rhaphidophora amplissima. The quadripartite chloroplast genomes (size range 163,335-164,751 bp) consist of a pair of inverted repeats (25,270-25,931 bp), separating a small single copy region (21,448-22,346 bp) from a large single copy region (89,714-91,841 bp). The genomes contain 114 unique genes, including four rRNA genes, 80 protein-coding genes, and 30 tRNA genes. Gene features, amino acid frequencies, codon usage, GC contents, oligonucleotide repeats, and inverted repeats dynamics exhibit similarities among the four genomes. Higher rate of synonymous substitutions was observed as compared to non-synonymous substitutions in 76 protein-coding genes. Positive selection was observed in seven protein-coding genes, including psbK, ndhK, ndhD, rbcL, accD, rps8, and ycf2. Our included species of Araceae showed the monophyly in Monsteroideae and other subfamilies. We report 30 suitable polymorphic regions. The polymorphic regions identified here might be suitable for designing unique and robust markers for inferring the phylogeny and phylogeography among closely related species within the genus Spathiphyllum and among distantly related species within the subfamily Monsteroideae. The chloroplast genomes presented here are a valuable contribution towards understanding the molecular evolutionary dynamics in the family Araceae.


Assuntos
Araceae/classificação , Araceae/genética , Evolução Molecular , Genes de Plantas/genética , Genoma de Cloroplastos/genética , Filogenia , Composição de Bases , Sequência de Bases , Cloroplastos/genética , Uso do Códon , DNA de Plantas , Genes de RNAr , Componentes Genômicos , Tamanho do Genoma , Anotação de Sequência Molecular , Fases de Leitura Aberta , Folhas de Planta/genética , Proteínas de Plantas/genética , Polimorfismo Genético , RNA de Transferência/genética
8.
BMC Genomics ; 21(1): 36, 2020 Jan 13.
Artigo em Inglês | MEDLINE | ID: mdl-31931702

RESUMO

BACKGROUND: Genome wide association studies (GWAS) on residual feed intake (RFI) and its component traits including daily dry matter intake (DMI), average daily gain (ADG), and metabolic body weight (MWT) were conducted in a population of 7573 animals from multiple beef cattle breeds based on 7,853,211 imputed whole genome sequence variants. The GWAS results were used to elucidate genetic architectures of the feed efficiency related traits in beef cattle. RESULTS: The DNA variant allele substitution effects approximated a bell-shaped distribution for all the traits while the distribution of additive genetic variances explained by single DNA variants followed a scaled inverse chi-squared distribution to a greater extent. With a threshold of P-value < 1.00E-05, 16, 72, 88, and 116 lead DNA variants on multiple chromosomes were significantly associated with RFI, DMI, ADG, and MWT, respectively. In addition, lead DNA variants with potentially large pleiotropic effects on DMI, ADG, and MWT were found on chromosomes 6, 14 and 20. On average, missense, 3'UTR, 5'UTR, and other regulatory region variants exhibited larger allele substitution effects in comparison to other functional classes. Intergenic and intron variants captured smaller proportions of additive genetic variance per DNA variant. Instead 3'UTR and synonymous variants explained a greater amount of genetic variance per DNA variant for all the traits examined while missense, 5'UTR and other regulatory region variants accounted for relatively more additive genetic variance per sequence variant for RFI and ADG, respectively. In total, 25 to 27 enriched cellular and molecular functions were identified with lipid metabolism and carbohydrate metabolism being the most significant for the feed efficiency traits. CONCLUSIONS: RFI is controlled by many DNA variants with relatively small effects whereas DMI, ADG, and MWT are influenced by a few DNA variants with large effects and many DNA variants with small effects. Nucleotide polymorphisms in regulatory region and synonymous functional classes play a more important role per sequence variant in determining variation of the feed efficiency traits. The genetic architecture as revealed by the GWAS of the imputed 7,853,211 DNA variants will improve our understanding on the genetic control of feed efficiency traits in beef cattle.


Assuntos
Estudos de Associação Genética , Componentes Genômicos , Estudo de Associação Genômica Ampla , Locos de Características Quantitativas , Característica Quantitativa Herdável , Animais , Bovinos , Ingestão de Alimentos , Variação Genética , Genômica/métodos , Polimorfismo de Nucleotídeo Único , Sequenciamento Completo do Genoma
9.
Genome Biol Evol ; 11(12): 3424-3435, 2019 12 01.
Artigo em Inglês | MEDLINE | ID: mdl-31742335

RESUMO

Within-species variation in genome size has been documented in many animals and plants. Despite its importance for understanding eukaryotic genome diversity, there is only sparse knowledge about how individual-level processes mediate genome size variation in populations. Here, we study a natural population of the rotifer Brachionus asplanchnoidis whose members differ up to 1.9-fold in diploid genome size, but were still able to interbreed and produce viable offspring. We show that genome size is highly heritable and can be artificially selected up or down, but not below a certain basal diploid genome size for this species. Analyses of segregation patterns in haploid males reveal that large genomic elements (several megabases in size) provide the substrate of genome size variation. These elements, and their segregation patterns, explain the generation of new genome size variants, the short-term evolutionary potential of genome size change in populations, and some seemingly paradoxical patterns, like an increase in genome size variation among highly inbred lines. Our study suggests that a conceptual model involving only two variables, 1) a basal genome size of the population, and 2) a vector containing information on additional elements that may increase genome size in this population (size, number, and meiotic segregation behavior), can effectively address most scenarios of short-term evolutionary change of genome size in a population.


Assuntos
Tamanho do Genoma/genética , Genoma Helmíntico/genética , Rotíferos/genética , Animais , Evolução Molecular , Feminino , Variação Genética , Genética Populacional , Componentes Genômicos/genética , Masculino , Meiose , Rotíferos/citologia
10.
Int J Mol Sci ; 20(5)2019 Mar 09.
Artigo em Inglês | MEDLINE | ID: mdl-30857296

RESUMO

Satellite DNA (satDNA) is the most variable fraction of the eukaryotic genome. Related species share a common ancestral satDNA library and changing of any library component in a particular lineage results in interspecific differences. Although the general developmental trend is clear, our knowledge of the origin and dynamics of satDNAs is still fragmentary. Here, we explore whole genome shotgun Illumina reads using the RepeatExplorer (RE) pipeline to infer satDNA family life stories in the genomes of Chenopodium species. The seven diploids studied represent separate lineages and provide an example of a species complex typical for angiosperms. Application of the RE pipeline allowed by similarity searches a determination of the satDNA family with a basic monomer of ~40 bp and to trace its transformation from the reconstructed ancestral to the species-specific sequences. As a result, three types of satDNA family evolutionary development were distinguished: (i) concerted evolution with mutation and recombination events; (ii) concerted evolution with a trend toward increased complexity and length of the satellite monomer; and (iii) non-concerted evolution, with low levels of homogenization and multidirectional trends. The third type is an example of entire repeatome transformation, thus producing a novel set of satDNA families, and genomes showing non-concerted evolution are proposed as a significant source for genomic diversity.


Assuntos
Chenopodium/genética , DNA de Plantas/genética , DNA Satélite/genética , Diploide , Evolução Molecular , Componentes Genômicos , Genoma de Planta , Sequenciamento de Nucleotídeos em Larga Escala , Filogenia , Análise de Sequência de DNA , Especificidade da Espécie
11.
Genome Biol Evol ; 11(3): 869-882, 2019 03 01.
Artigo em Inglês | MEDLINE | ID: mdl-30830203

RESUMO

Clownfishes are an iconic group of coral reef fishes, especially known for their mutualism with sea anemones. This mutualism is particularly interesting as it likely acted as the key innovation that triggered clownfish adaptive radiation. Indeed, after the acquisition of the mutualism, clownfishes diversified into multiple ecological niches linked with host and habitat use. However, despite the importance of this mutualism, the genetic mechanisms allowing clownfishes to interact with sea anemones are still unclear. Here, we used a comparative genomics and molecular evolutionary analyses to investigate the genetic basis of clownfish mutualism with sea anemones. We assembled and annotated the genome of nine clownfish species and one closely related outgroup. Orthologous genes inferred between these species and additional publicly available teleost genomes resulted in almost 16,000 genes that were tested for positively selected substitutions potentially involved in the adaptation of clownfishes to live in sea anemones. We identified 17 genes with a signal of positive selection at the origin of clownfish radiation. Two of them (Versican core protein and Protein O-GlcNAse) show particularly interesting functions associated with N-acetylated sugars, which are known to be involved in sea anemone discharge of toxins. This study provides the first insights into the genetic mechanisms of clownfish mutualism with sea anemones. Indeed, we identified the first candidate genes likely to be associated with clownfish protection form sea anemones, and thus the evolution of their mutualism. Additionally, the genomic resources acquired represent a valuable resource for further investigation of the genomic basis of clownfish adaptive radiation.


Assuntos
Evolução Biológica , Perciformes/genética , Anêmonas-do-Mar , Simbiose/genética , Animais , Componentes Genômicos , Seleção Genética
12.
Cell ; 176(6): 1502-1515.e10, 2019 03 07.
Artigo em Inglês | MEDLINE | ID: mdl-30799036

RESUMO

Several general principles of global 3D genome organization have recently been established, including non-random positioning of chromosomes and genes in the cell nucleus, distinct chromatin compartments, and topologically associating domains (TADs). However, the extent and nature of cell-to-cell and cell-intrinsic variability in genome architecture are still poorly characterized. Here, we systematically probe heterogeneity in genome organization. High-throughput optical mapping of several hundred intra-chromosomal interactions in individual human fibroblasts demonstrates low association frequencies, which are determined by genomic distance, higher-order chromatin architecture, and chromatin environment. The structure of TADs is variable between individual cells, and inter-TAD associations are common. Furthermore, single-cell analysis reveals independent behavior of individual alleles in single nuclei. Our observations reveal extensive variability and heterogeneity in genome organization at the level of individual alleles and demonstrate the coexistence of a broad spectrum of genome configurations in a cell population.


Assuntos
Montagem e Desmontagem da Cromatina/fisiologia , Cromatina/genética , Componentes Genômicos/fisiologia , Linhagem Celular , Núcleo Celular/genética , Cromossomos , Fibroblastos/fisiologia , Genoma/genética , Componentes Genômicos/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Masculino , Análise de Célula Única
13.
Viruses ; 11(2)2019 02 04.
Artigo em Inglês | MEDLINE | ID: mdl-30720711

RESUMO

Subterranean clover stunt virus (SCSV) is a type species of the genus Nanovirus in the family Nanoviridae. It was the first single-stranded DNA plant virus with a multipartite genome, of which genomic DNA sequences had been determined. All nanoviruses have eight genome components except SCSV, for which homologs of two genome components present in all other nanovirus genomes, DNA-U2 and DNA-U4, were lacking. We analysed archived and more recent samples from SCSV-infected legume plants to verify its genome composition and found the missing genome components. These results indicated that SCSV also has eight genome components and is a typical member of the genus Nanovirus.


Assuntos
DNA Viral/genética , Componentes Genômicos , Genoma Viral , Nanovirus/genética , Evolução Molecular , Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de DNA
14.
Hum Immunol ; 79(6): 491-493, 2018 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-29580811

RESUMO

Two novel HLA class II alleles, DRB4*03:01N and DQB1*03:276N, containing large deletions were identified during routine typing. Extraction of DNA encompassing the deletions was carried out with a panel of capture oligonucleotides followed by whole genome amplification. Next generation DNA sequencing was then used to characterize the sequences. DRB4*03:01N has a 16 kilobase pair deletion stretching upstream from intron 2 toward centromeric DRB8. DQB1*03:276N has two deletions separated by 844 nucleotides. The first deletion (3.7 kilobase pairs) is upstream of intron 1 and the second deletion removes 3.3 kilobase pairs further upstream towards centromeric DQA2.


Assuntos
Alelos , Genótipo , Cadeias beta de HLA-DQ/genética , Cadeias HLA-DRB4/genética , Deleção de Sequência/genética , Primers do DNA/genética , Genoma , Componentes Genômicos/genética , Teste de Histocompatibilidade , Humanos , Íntrons/genética , Polimorfismo Genético
15.
Mol Biol Evol ; 35(4): 855-870, 2018 04 01.
Artigo em Inglês | MEDLINE | ID: mdl-29294063

RESUMO

Multicellularity is the premier example of a major evolutionary transition in individuality and was a foundational event in the evolution of macroscopic biodiversity. The volvocine chlorophyte lineage is well suited for studying this process. Extant members span unicellular, simple colonial, and obligate multicellular taxa with germ-soma differentiation. Here, we report the nuclear genome sequence of one of the most morphologically simple organisms in this lineage-the 4-celled colonial Tetrabaena socialis and compare this to the three other complete volvocine nuclear genomes. Using conservative estimates of gene family expansions a minimal set of expanded gene families was identified that associate with the origin of multicellularity. These families are rich in genes related to developmental processes. A subset of these families is lineage specific, which suggests that at a genomic level the evolution of multicellularity also includes lineage-specific molecular developments. Multiple points of evidence associate modifications to the ubiquitin proteasomal pathway (UPP) with the beginning of coloniality. Genes undergoing positive or accelerating selection in the multicellular volvocines were found to be enriched in components of the UPP and gene families gained at the origin of multicellularity include components of the UPP. A defining feature of colonial/multicellular life cycles is the genetic control of cell number. The genomic data presented here, which includes diversification of cell cycle genes and modifications to the UPP, align the genetic components with the evolution of this trait.


Assuntos
Evolução Biológica , Clorófitas/genética , Genes cdc , Componentes Genômicos , Ciclinas/genética , Genes do Retinoblastoma , Família Multigênica , Complexo de Endopeptidases do Proteassoma/genética , Seleção Genética , Transcriptoma , Ubiquitina/genética
16.
Hum Mol Genet ; 26(R2): R208-R215, 2017 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-28977451

RESUMO

It has been known for over a century that chromatin is not randomly distributed within the nucleus. However, the question of how DNA is folded and the influence of such folding on nuclear processes remain topics of intensive current research. A longstanding, unanswered question is whether nuclear organization is simply a reflection of nuclear processes such as transcription and replication, or whether chromatin is folded by independent mechanisms and this per se encodes function? Evidence is emerging that both may be true. Here, using the α-globin gene cluster as an illustrative model, we provide an overview of the most recent insights into the layers of genome organization across different scales and how this relates to gene activity.


Assuntos
Componentes Genômicos/genética , Genoma/genética , Genoma/fisiologia , Animais , Núcleo Celular/genética , Núcleo Celular/fisiologia , Cromatina/genética , Cromatina/fisiologia , DNA/genética , Replicação do DNA/genética , Humanos , Família Multigênica/genética , Conformação de Ácido Nucleico , Transcrição Gênica/genética , Transcrição Gênica/fisiologia , alfa-Globinas/genética
17.
Nucleic Acids Res ; 45(18): 10403-10414, 2017 Oct 13.
Artigo em Inglês | MEDLINE | ID: mdl-28977568

RESUMO

Topologically associated domains (TADs) are 3D genomic structures with high internal interactions that play important roles in genome compaction and gene regulation. Their genomic locations and their association with CCCTC-binding factor (CTCF)-binding sites and transcription start sites (TSSs) were recently reported. However, the relationship between TADs and other genomic elements has not been systematically evaluated. This was addressed in the present study, with a focus on the enrichment of these genomic elements and their ability to predict the TAD boundary region. We found that consensus CTCF-binding sites were strongly associated with TAD boundaries as well as with the transcription factors (TFs) Zinc finger protein (ZNF)143 and Yin Yang (YY)1. TAD boundary-associated genomic elements include DNase I-hypersensitive sites, H3K36 trimethylation, TSSs, RNA polymerase II, and TFs such as Specificity protein 1, ZNF274 and SIX homeobox 5. Computational modeling with these genomic elements suggests that they have distinct roles in TAD boundary formation. We propose a structural model of TAD boundaries based on these findings that provides a basis for studying the mechanism of chromatin structure formation and gene regulation.


Assuntos
Cromatina/genética , Mapeamento Cromossômico/métodos , Biologia Computacional/métodos , Genoma Humano , Elementos Reguladores de Transcrição , Algoritmos , Sítios de Ligação , Cromatina/metabolismo , Bases de Dados Genéticas , Regulação da Expressão Gênica , Componentes Genômicos , Humanos , Regiões Promotoras Genéticas
18.
PLoS Comput Biol ; 13(2): e1005383, 2017 02.
Artigo em Inglês | MEDLINE | ID: mdl-28192430

RESUMO

Selenocysteine (Sec) is known as the 21st amino acid, a cysteine analogue with selenium replacing sulphur. Sec is inserted co-translationally in a small fraction of proteins called selenoproteins. In selenoprotein genes, the Sec specific tRNA (tRNASec) drives the recoding of highly specific UGA codons from stop signals to Sec. Although found in organisms from the three domains of life, Sec is not universal. Many species are completely devoid of selenoprotein genes and lack the ability to synthesize Sec. Since tRNASec is a key component in selenoprotein biosynthesis, its efficient identification in genomes is instrumental to characterize the utilization of Sec across lineages. Available tRNA prediction methods fail to accurately predict tRNASec, due to its unusual structural fold. Here, we present Secmarker, a method based on manually curated covariance models capturing the specific tRNASec structure in archaea, bacteria and eukaryotes. We exploited the non-universality of Sec to build a proper benchmark set for tRNASec predictions, which is not possible for the predictions of other tRNAs. We show that Secmarker greatly improves the accuracy of previously existing methods constituting a valuable tool to identify tRNASec genes, and to efficiently determine whether a genome contains selenoproteins. We used Secmarker to analyze a large set of fully sequenced genomes, and the results revealed new insights in the biology of tRNASec, led to the discovery of a novel bacterial selenoprotein family, and shed additional light on the phylogenetic distribution of selenoprotein containing genomes. Secmarker is freely accessible for download, or online analysis through a web server at http://secmarker.crg.cat.


Assuntos
Mapeamento Cromossômico/métodos , Marcadores Genéticos/genética , Genoma/genética , Ensaios de Triagem em Larga Escala/métodos , RNA de Transferência Aminoácido-Específico/genética , Aminoacil-RNA de Transferência/genética , Algoritmos , Componentes Genômicos/genética , Selenocisteína
19.
PLoS One ; 11(8): e0161531, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-27556533

RESUMO

Obtaining sequence data from historical museum specimens has been a growing research interest, invigorated by next-generation sequencing methods that allow inputs of highly degraded DNA. We applied a target enrichment and next-generation sequencing protocol to generate ultraconserved elements (UCEs) from 51 large carpenter bee specimens (genus Xylocopa), representing 25 species with specimen ages ranging from 2-121 years. We measured the correlation between specimen age and DNA yield (pre- and post-library preparation DNA concentration) and several UCE sequence capture statistics (raw read count, UCE reads on target, UCE mean contig length and UCE locus count) with linear regression models. We performed piecewise regression to test for specific breakpoints in the relationship of specimen age and DNA yield and sequence capture variables. Additionally, we compared UCE data from newer and older specimens of the same species and reconstructed their phylogeny in order to confirm the validity of our data. We recovered 6-972 UCE loci from samples with pre-library DNA concentrations ranging from 0.06-9.8 ng/µL. All investigated DNA yield and sequence capture variables were significantly but only moderately negatively correlated with specimen age. Specimens of age 20 years or less had significantly higher pre- and post-library concentrations, UCE contig lengths, and locus counts compared to specimens older than 20 years. We found breakpoints in our data indicating a decrease of the initial detrimental effect of specimen age on pre- and post-library DNA concentration and UCE contig length starting around 21-39 years after preservation. Our phylogenetic results confirmed the integrity of our data, giving preliminary insights into relationships within Xylocopa. We consider the effect of additional factors not measured in this study on our age-related sequence capture results, such as DNA fragmentation and preservation method, and discuss the promise of the UCE approach for large-scale projects in insect phylogenomics using museum specimens.


Assuntos
Sequência Conservada , Componentes Genômicos , Genoma de Inseto , Insetos/classificação , Insetos/genética , Análise de Sequência de DNA , Animais , Biologia Computacional/métodos , Filogenia
20.
Genet Mol Res ; 15(3)2016 Jul 25.
Artigo em Inglês | MEDLINE | ID: mdl-27525861

RESUMO

Repetitive sequences of variable length are common in almost all eukaryotic genomes, and most of them are presumed to have important biomedical functions and can cause genomic instability. Next-generation sequencing (NGS) technologies provide the possibility of identifying capturing these repetitive sequences directly from the NGS data. In this study, we assessed the performances in identifying capturing repeats of leading assemblers, such as Velvet, SOAPdenovo, SGA, MSR-CA, Bambus2, ALLPATHS-LG, and AByss using three real NGS datasets. Our results indicated that most of them performed poorly in capturing the repeats. Consequently, we proposed a repetitive sequence assembler, named NGSReper, for capturing repeats from NGS data. Simulated datasets were used to validate the feasibility of NGSReper. The results indicate that the completeness of capturing repeat is up to 99%. Cross validation was performed in three real NGS datasets, and extensive comparisons indicate that NGSReper performed best in terms of completeness and accuracy in capturing repeats. In conclusion, NGSReper is an appropriate and suitable tool for capturing repeats directly from NGS data.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , Sequências Repetitivas de Ácido Nucleico/genética , Algoritmos , Componentes Genômicos , Genoma Humano , Humanos , Análise de Sequência de DNA/métodos , Software
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...