Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 50
Filtrar
1.
Genome Res ; 34(3): 498-513, 2024 Apr 25.
Artigo em Inglês | MEDLINE | ID: mdl-38508693

RESUMO

Hydractinia is a colonial marine hydroid that shows remarkable biological properties, including the capacity to regenerate its entire body throughout its lifetime, a process made possible by its adult migratory stem cells, known as i-cells. Here, we provide an in-depth characterization of the genomic structure and gene content of two Hydractinia species, Hydractinia symbiolongicarpus and Hydractinia echinata, placing them in a comparative evolutionary framework with other cnidarian genomes. We also generated and annotated a single-cell transcriptomic atlas for adult male H. symbiolongicarpus and identified cell-type markers for all major cell types, including key i-cell markers. Orthology analyses based on the markers revealed that Hydractinia's i-cells are highly enriched in genes that are widely shared amongst animals, a striking finding given that Hydractinia has a higher proportion of phylum-specific genes than any of the other 41 animals in our orthology analysis. These results indicate that Hydractinia's stem cells and early progenitor cells may use a toolkit shared with all animals, making it a promising model organism for future exploration of stem cell biology and regenerative medicine. The genomic and transcriptomic resources for Hydractinia presented here will enable further studies of their regenerative capacity, colonial morphology, and ability to distinguish self from nonself.


Assuntos
Genoma , Hidrozoários , Animais , Hidrozoários/genética , Evolução Molecular , Transcriptoma , Células-Tronco/metabolismo , Masculino , Filogenia , Análise de Célula Única/métodos
2.
bioRxiv ; 2023 Aug 27.
Artigo em Inglês | MEDLINE | ID: mdl-37786714

RESUMO

Hydractinia is a colonial marine hydroid that exhibits remarkable biological properties, including the capacity to regenerate its entire body throughout its lifetime, a process made possible by its adult migratory stem cells, known as i-cells. Here, we provide an in-depth characterization of the genomic structure and gene content of two Hydractinia species, H. symbiolongicarpus and H. echinata, placing them in a comparative evolutionary framework with other cnidarian genomes. We also generated and annotated a single-cell transcriptomic atlas for adult male H. symbiolongicarpus and identified cell type markers for all major cell types, including key i-cell markers. Orthology analyses based on the markers revealed that Hydractinia's i-cells are highly enriched in genes that are widely shared amongst animals, a striking finding given that Hydractinia has a higher proportion of phylum-specific genes than any of the other 41 animals in our orthology analysis. These results indicate that Hydractinia's stem cells and early progenitor cells may use a toolkit shared with all animals, making it a promising model organism for future exploration of stem cell biology and regenerative medicine. The genomic and transcriptomic resources for Hydractinia presented here will enable further studies of their regenerative capacity, colonial morphology, and ability to distinguish self from non-self.

3.
Front Immunol ; 14: 1142634, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37483601

RESUMO

Introduction: The majority of studies on oxidative phosphorylation in immune cells have been performed in mouse models, necessitating human translation. To understand the impact of oxidative phosphorylation (OXPHOS) deficiency on human immunity, we studied children with primary mitochondrial disease (MtD). Methods: scRNAseq analysis of peripheral blood mononuclear cells was performed on matched children with MtD (N = 4) and controls (N = 4). To define B cell function we performed phage display immunoprecipitation sequencing on a cohort of children with MtD (N = 19) and controls (N = 16). Results: Via scRNAseq, we found marked reductions in select populations involved in the humoral immune response, especially antigen presenting cells, B cell and plasma populations, with sparing of T cell populations. MTRNR2L8, a marker of bioenergetic stress, was significantly elevated in populations that were most depleted. mir4485, a miRNA contained in the intron of MTRNR2L8, was co-expressed. Knockdown studies of mir4485 demonstrated its role in promoting survival by modulating apoptosis. To determine the functional consequences of our findings on humoral immunity, we studied the antiviral antibody repertoire in children with MtD and controls using phage display and immunoprecipitation sequencing. Despite similar viral exposomes, MtD displayed antiviral antibodies with less robust fold changes and limited polyclonality. Discussion: Overall, we show that children with MtD display perturbations in the B cell repertoire which may impact humoral immunity and the ability to clear viral infections.


Assuntos
Leucócitos Mononucleares , Fosforilação Oxidativa , Camundongos , Animais , Criança , Humanos , Imunidade Humoral , Linfócitos B , Antivirais
4.
Res Sq ; 2023 May 10.
Artigo em Inglês | MEDLINE | ID: mdl-37215014

RESUMO

Modulation of metabolic flux through pyruvate dehydrogenase complex (PDC) plays an important role in T cell activation and differentiation. PDC sits at the transition between glycolysis and the tricarboxylic acid cycle and is a major producer of acetyl-CoA, marking it as a potential metabolic and epigenetic node To understand the role of pyruvate dehydrogenase complex in T cell differentiation, we generated mice deficient in T cell pyruvate dehydrogenase E1A (Pdha) subunit using a CD4-cre recombinase-based strategy. Herein, we show that genetic ablation of PDC activity in T cells (TPdh-/-) leads to marked perturbations in glycolysis, the tricarboxylic acid cycle, and OXPHOS. TPdh-/- T cells became dependent upon substrate level phosphorylation via glycolysis, secondary to depressed OXPHOS. Due to the block of PDC activity, histone acetylation was also reduced, including H3K27, a critical site for CD8+ TM differentiation. Transcriptional and functional profiling revealed abnormal CD8+ TM differentiation in vitro. Collectively, our data indicate that PDC integrates the metabolome and epigenome in CD8+ memory T cell differentiation. Targeting this metabolic and epigenetic node can have widespread ramifications on cellular function.

5.
Res Sq ; 2023 Jan 27.
Artigo em Inglês | MEDLINE | ID: mdl-36789409

RESUMO

Background: Modulation of metabolic flux through pyruvate dehydrogenase complex (PDC) plays an important role in T cell activation and differentiation. PDC sits at the transition between glycolysis and the tricarboxylic acid cycle and is a major producer of acetyl-CoA, marking it as a potential metabolic and epigenetic node. Methods: To understand the role of pyruvate dehydrogenase complex in T cell differentiation, we generated mice deficient in T cell pyruvate dehydrogenase E1A (Pdha) subunit using a CD4-cre recombinase-based strategy. To control for the contribution of exogenous metabolites in vivo, we conducted our T cell functional studies in vitro. T cells were differentiated into memory and effector T cells using standardized protocols. Cells were analyzed using stable isotopic tracing studies, metabolomics, RNAseq, ATACseq, ChIPseq and histone proteomics. Results: Herein, we show that genetic ablation of PDC activity in T cells (TPdh-/-) leads to marked perturbations in glycolysis, the tricarboxylic acid cycle, and OXPHOS. Due to depressed OXPHOS, TPdh-/-T cells became dependent upon substrate level phosphorylation via glycolysis. Due to the block of PDC activity, histone acetylation was reduced, as were most other types of post translational modifications. Transcriptional and functional profiling revealed abnormal CD8+ memory T cell differentiation in vitro. Conclusions: Collectively, our data indicate that PDC integrates the metabolome and epigenome in memory T cell differentiation. Targeting this metabolic and epigenetic node can have widespread ramifications on cellular function.

6.
Genome Res ; 33(2): 283-298, 2023 02.
Artigo em Inglês | MEDLINE | ID: mdl-36639202

RESUMO

The epithelial and interstitial stem cells of the freshwater polyp Hydra are the best-characterized stem cell systems in any cnidarian, providing valuable insight into cell type evolution and the origin of stemness in animals. However, little is known about the transcriptional regulatory mechanisms that determine how these stem cells are maintained and how they give rise to their diverse differentiated progeny. To address such questions, a thorough understanding of transcriptional regulation in Hydra is needed. To this end, we generated extensive new resources for characterizing transcriptional regulation in Hydra, including new genome assemblies for Hydra oligactis and the AEP strain of Hydra vulgaris, an updated whole-animal single-cell RNA-seq atlas, and genome-wide maps of chromatin interactions, chromatin accessibility, sequence conservation, and histone modifications. These data revealed the existence of large kilobase-scale chromatin interaction domains in the Hydra genome that contain transcriptionally coregulated genes. We also uncovered the transcriptomic profiles of two previously molecularly uncharacterized cell types: isorhiza-type nematocytes and somatic gonad ectoderm. Finally, we identified novel candidate regulators of cell type-specific transcription, several of which have likely been conserved at least since the divergence of Hydra and the jellyfish Clytia hemisphaerica more than 400 million years ago.


Assuntos
Hydra , Animais , Hydra/genética , Hydra/metabolismo , Diferenciação Celular , Cromatina/metabolismo , Cromossomos , Epigênese Genética
7.
Am J Hum Genet ; 110(1): 3-12, 2023 01 05.
Artigo em Inglês | MEDLINE | ID: mdl-36608682

RESUMO

Although genomic research has predominantly relied on phenotypic ascertainment of individuals affected with heritable disease, the falling costs of sequencing allow consideration of genomic ascertainment and reverse phenotyping (the ascertainment of individuals with specific genomic variants and subsequent evaluation of physical characteristics). In this research modality, the scientific question is inverted: investigators gather individuals with a genomic variant and test the hypothesis that there is an associated phenotype via targeted phenotypic evaluations. Genomic ascertainment research is thus a model of predictive genomic medicine and genomic screening. Here, we provide our experience implementing this research method. We describe the infrastructure we developed to perform reverse phenotyping studies, including aggregating a super-cohort of sequenced individuals who consented to recontact for genomic ascertainment research. We assessed 13 studies completed at the National Institutes of Health (NIH) that piloted our reverse phenotyping approach. The studies can be broadly categorized as (1) facilitating novel genotype-disease associations, (2) expanding the phenotypic spectra, or (3) demonstrating ex vivo functional mechanisms of disease. We highlight three examples of reverse phenotyping studies in detail and describe how using a targeted reverse phenotyping approach (as opposed to phenotypic ascertainment or clinical informatics approaches) was crucial to the conclusions reached. Finally, we propose a framework and address challenges to building collaborative genomic ascertainment research programs at other institutions. Our goal is for more researchers to take advantage of this approach, which will expand our understanding of the predictive capability of genomic medicine and increase the opportunity to mitigate genomic disease.


Assuntos
Genoma , Informática Médica , Fenótipo , Genótipo , Genômica/métodos
8.
Mol Biol Evol ; 38(10): 4628-4633, 2021 09 27.
Artigo em Inglês | MEDLINE | ID: mdl-34048573

RESUMO

To address the void in the availability of high-quality proteomic data traversing the animal tree, we have implemented a pipeline for generating de novo assemblies based on publicly available data from the NCBI Sequence Read Archive, yielding a comprehensive collection of proteomes from 100 species spanning 21 animal phyla. We have also created the Animal Proteome Database (AniProtDB), a resource providing open access to this collection of high-quality metazoan proteomes, along with information on predicted proteins and protein domains for each taxonomic classification and the ability to perform sequence similarity searches against all proteomes generated using this pipeline. This solution vastly increases the utility of these data by removing the barrier to access for research groups who do not have the expertise or resources to generate these data themselves and enables the use of data from nontraditional research organisms that have the potential to address key questions in biomedicine.


Assuntos
Proteoma , Proteômica , Animais , Bases de Dados Factuais , Genômica , Análise de Sequência
9.
Mol Ther Methods Clin Dev ; 20: 247-257, 2021 Mar 12.
Artigo em Inglês | MEDLINE | ID: mdl-33473358

RESUMO

Adeno-associated viral (AAV) vectors have emerged as the preferred platform for in vivo gene transfer because of their combined efficacy and safety. However, insertional mutagenesis with the subsequent development of hepatocellular carcinomas (HCCs) has been recurrently noted in newborn mice treated with high doses of AAV, and more recently, the association of wild-type AAV integrations in a subset of human HCCs has been documented. Here, we address, in a comprehensive, prospective study, the long-term risk of tumorigenicity in young adult mice following delivery of single-stranded AAVs targeting liver. HCC incidence in mice treated with therapeutic and reporter AAVs was low, in contrast to what has been previously documented in mice treated as newborns with higher doses of AAV. Specifically, HCCs developed in 6 out 76 of AAV-treated mice, and a pathogenic integration of AAV was found in only one tumor. Also, no evidence of liver tumorigenesis was found in juvenile AAV-treated mucopolysaccharidosis type VI (MPS VI) cats followed as long as 8 years after vector administration. Together, our results support the low risk of tumorigenesis associated with AAV-mediated gene transfer targeting juvenile/young adult livers, although constant monitoring of subjects enrolled in AAV clinical trial is advisable.

10.
Biol Methods Protoc ; 5(1): bpz019, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-31984226

RESUMO

Single-cell RNA sequencing (scRNA-seq) of human primary tissues is a rapidly emerging tool for investigating human health and disease at the molecular level. However, optimal processing of solid tissues presents a number of technical and logistical challenges, especially for tissues that are only available at autopsy, which includes pancreatic islets, a tissue that is highly relevant to diabetes. To assess the possible effects of different sample preparation protocols on fresh islet samples, we performed a detailed comparison of scRNA-seq data generated with islets isolated from a human donor but processed according to four treatment strategies, including fixation and cryopreservation. We found significant and reproducible differences in the proportion of cell types identified, and more minor effects on cell-specific patterns of gene expression. Fresh islets from a second donor confirmed gene expression signatures of alpha and beta subclusters. These findings may well apply to other tissues, emphasizing the need for careful consideration when choosing processing methods, comparing results between different studies, and/or interpreting data in the context of multiple cell types from preserved tissue.

11.
Transl Behav Med ; 10(2): 441-450, 2020 05 20.
Artigo em Inglês | MEDLINE | ID: mdl-31505002

RESUMO

How individuals perceive uncertainties in sequencing results may affect their clinical utility. The purpose of this study was to explore perceptions of uncertainties in carrier results and how they relate to psychological well-being and health behavior. Post-reproductive adults (N = 462) were randomized to receive carrier results from sequencing through either a web platform or a genetic counselor. On average, participants received two results. Group differences in affective, evaluative, and clinical uncertainties were assessed from baseline to 1 and 6 months; associations with test-specific distress and communication of results were assessed at 6 months. Reductions in affective uncertainty (∆x̅ = 0.78, 95% CI: 0.53, 1.02) and evaluative uncertainty (∆x̅ = 0.69, 95% CI: 0.51, 0.87) followed receipt of results regardless of randomization arm at 1 month. Participants in the web platform arm reported greater clinical uncertainty than those in the genetic counselor arm at 1 and 6 months; this was corroborated by the 1,230 questions asked of the genetic counselor and residual questions reported by those randomized to the web platform. Evaluative uncertainty was associated with a lower likelihood of communicating results to health care providers. Clinical uncertainty was associated with a lower likelihood of communicating results to children. Learning one's carrier results may reduce perceptions of uncertainties, though web-based return may lead to less reduction in clinical uncertainty in the short term. These findings warrant reinforcement of clinical implications to minimize residual questions and promote appropriate health behavior (communicating results to at-risk relatives in the case of carrier results), especially when testing alternative delivery models.


Assuntos
Tomada de Decisão Clínica , Exoma , Adulto , Criança , Comunicação , Feminino , Humanos , Masculino , Percepção , Incerteza
12.
Sci Adv ; 5(6): eaav0547, 2019 06.
Artigo em Inglês | MEDLINE | ID: mdl-31249862

RESUMO

For over a thousand years, the common goldfish (Carassius auratus) was raised throughout Asia for food and as an ornamental pet. As a very close relative of the common carp (Cyprinus carpio), goldfish share the recent genome duplication that occurred approximately 14 million years ago in their common ancestor. The combination of centuries of breeding and a wide array of interesting body morphologies provides an exciting opportunity to link genotype to phenotype and to understand the dynamics of genome evolution and speciation. We generated a high-quality draft sequence and gene annotations of a "Wakin" goldfish using 71X PacBio long reads. The two subgenomes in goldfish retained extensive synteny and collinearity between goldfish and zebrafish. However, genes were lost quickly after the carp whole-genome duplication, and the expression of 30% of the retained duplicated gene diverged substantially across seven tissues sampled. Loss of sequence identity and/or exons determined the divergence of the expression levels across all tissues, while loss of conserved noncoding elements determined expression variance between different tissues. This assembly provides an important resource for comparative genomics and understanding the causes of goldfish variants.


Assuntos
Duplicação Gênica/genética , Genoma/genética , Carpa Dourada/genética , Animais , Ásia , Carpas/genética , Evolução Molecular , Éxons/genética , Genômica/métodos , Genótipo , Fenótipo , Peixe-Zebra/genética
13.
Mol Cell Biochem ; 442(1-2): 29-38, 2018 May.
Artigo em Inglês | MEDLINE | ID: mdl-28913673

RESUMO

The ADAM (a disintegrin and metalloprotease) protein family uniquely exhibits both catalytic and adhesive properties. In the well-defined process of ectodomain shedding, ADAMs transform latent, cell-bound substrates into soluble, biologically active derivatives to regulate a spectrum of normal and pathological processes. In contrast, the integrin ligand properties of ADAMs are not fully understood. Emerging models posit that ADAM-integrin interactions regulate shedding activity by localizing or sequestering the ADAM sheddase. Interestingly, 8 of the 21 human ADAMs are predicted to be catalytically inactive. Unlike their catalytically active counterparts, integrin recognition of these "dead" enzymes has not been largely reported. The present study delineates the integrin ligand properties of a group of non-catalytic ADAMs. Here we report that human ADAM11, ADAM23, and ADAM29 selectively support integrin α4-dependent cell adhesion. This is the first demonstration that the disintegrin-like domains of multiple catalytically inactive ADAMs are ligands for a select subset of integrin receptors that also recognize catalytically active ADAMs.


Assuntos
Proteínas ADAM/metabolismo , Integrina alfa4/metabolismo , Proteínas ADAM/genética , Animais , Células CHO , Adesão Celular/fisiologia , Cricetulus , Humanos , Integrina alfa4/genética , Células Jurkat , Células K562 , Ligantes
14.
Cell Syst ; 4(1): 31-45.e6, 2017 01 25.
Artigo em Inglês | MEDLINE | ID: mdl-27916600

RESUMO

It is unclear how standing genetic variation affects the prognosis of prostate cancer patients. To provide one controlled answer to this problem, we crossed a dominant, penetrant mouse model of prostate cancer to Diversity Outbred mice, a collection of animals that carries over 40 million SNPs. Integration of disease phenotype and SNP variation data in 493 F1 males identified a metastasis modifier locus on Chromosome 8 (LOD = 8.42); further analysis identified the genes Rwdd4, Cenpu, and Casp3 as functional effectors of this locus. Accordingly, analysis of over 5,300 prostate cancer patient samples revealed correlations between the presence of genetic variants at these loci, their expression levels, cancer aggressiveness, and patient survival. We also observed that ectopic overexpression of RWDD4 and CENPU increased the aggressiveness of two human prostate cancer cell lines. In aggregate, our approach demonstrates how well-characterized genetic variation in mice can be harnessed in conjunction with systems genetics approaches to identify and characterize germline modifiers of human disease processes.


Assuntos
Mapeamento Cromossômico/métodos , Neoplasias da Próstata/genética , Animais , Caspase 3/genética , Proteínas de Ciclo Celular/genética , Proteínas de Ciclo Celular/fisiologia , Camundongos de Cruzamento Colaborativo/genética , Modelos Animais de Doenças , Genética Populacional/métodos , Estudo de Associação Genômica Ampla , Células Germinativas/patologia , Mutação em Linhagem Germinativa/genética , Humanos , Masculino , Camundongos , Herança Multifatorial/genética , Metástase Neoplásica/genética , Processos Neoplásicos , Fenótipo , Polimorfismo de Nucleotídeo Único , Neoplasias da Próstata/metabolismo , Locos de Características Quantitativas
16.
Nucleic Acids Res ; 44(D1): D822-6, 2016 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-26438539

RESUMO

CRISPRz (http://research.nhgri.nih.gov/CRISPRz/) is a database of CRISPR/Cas9 target sequences that have been experimentally validated in zebrafish. Programmable RNA-guided CRISPR/Cas9 has recently emerged as a simple and efficient genome editing method in various cell types and organisms, including zebrafish. Because the technique is so easy and efficient in zebrafish, the most valuable asset is no longer a mutated fish (which has distribution challenges), but rather a CRISPR/Cas9 target sequence to the gene confirmed to have high mutagenic efficiency. With a highly active CRISPR target, a mutant fish can be quickly replicated in any genetic background anywhere in the world. However, sgRNA's vary widely in their activity and models for predicting target activity are imperfect. Thus, it is very useful to collect in one place validated CRISPR target sequences with their relative mutagenic activities. A researcher could then select a target of interest in the database with an expected activity. Here, we report the development of CRISPRz, a database of validated zebrafish CRISPR target sites collected from published sources, as well as from our own in-house large-scale mutagenesis project. CRISPRz can be searched using multiple inputs such as ZFIN IDs, accession number, UniGene ID, or gene symbols from zebrafish, human and mouse.


Assuntos
Sistemas CRISPR-Cas , Bases de Dados Genéticas , RNA , Peixe-Zebra/genética , Animais , Marcação de Genes , Humanos , Camundongos , Mutagênese , Peixe-Zebra/embriologia
17.
BMC Cancer ; 14: 884, 2014 Nov 26.
Artigo em Inglês | MEDLINE | ID: mdl-25427824

RESUMO

BACKGROUND: Endometrial cancer (EC) is the 8th leading cause of cancer death amongst American women. Most ECs are endometrioid, serous, or clear cell carcinomas, or an admixture of histologies. Serous and clear ECs are clinically aggressive tumors for which alternative therapeutic approaches are needed. The purpose of this study was to search for somatic mutations in the tyrosine kinome of serous and clear cell ECs, because mutated kinases can point to potential therapeutic targets. METHODS: In a mutation discovery screen, we PCR amplified and Sanger sequenced the exons encoding the catalytic domains of 86 tyrosine kinases from 24 serous, 11 clear cell, and 5 mixed histology ECs. For somatically mutated genes, we next sequenced the remaining coding exons from the 40 discovery screen tumors and sequenced all coding exons from another 72 ECs (10 clear cell, 21 serous, 41 endometrioid). We assessed the copy number of mutated kinases in this cohort of 112 tumors using quantitative real time PCR, and we used immunoblotting to measure expression of these kinases in endometrial cancer cell lines. RESULTS: Overall, we identified somatic mutations in TNK2 (tyrosine kinase non-receptor, 2) and DDR1 (discoidin domain receptor tyrosine kinase 1) in 5.3% (6 of 112) and 2.7% (3 of 112) of ECs. Copy number gains of TNK2 and DDR1 were identified in another 4.5% and 0.9% of 112 cases respectively. Immunoblotting confirmed TNK2 and DDR1 expression in endometrial cancer cell lines. Three of five missense mutations in TNK2 and one of two missense mutations in DDR1 are predicted to impact protein function by two or more in silico algorithms. The TNK2(P761Rfs*72) frameshift mutation was recurrent in EC, and the DDR1(R570Q) missense mutation was recurrent across tumor types. CONCLUSIONS: This is the first study to systematically search for mutations in the tyrosine kinome in clear cell endometrial tumors. Our findings indicate that high-frequency somatic mutations in the catalytic domains of the tyrosine kinome are rare in clear cell ECs. We uncovered ten new mutations in TNK2 and DDR1 within serous and endometrioid ECs, thus providing novel insights into the mutation spectrum of each gene in EC.


Assuntos
Adenocarcinoma de Células Claras/genética , Cistadenocarcinoma Seroso/genética , Neoplasias do Endométrio/genética , Mutação , Proteínas Tirosina Quinases/genética , Receptores Proteína Tirosina Quinases/genética , Adenocarcinoma de Células Claras/patologia , Linhagem Celular Tumoral , Cistadenocarcinoma Seroso/patologia , Variações do Número de Cópias de DNA , Análise Mutacional de DNA , DNA Polimerase II/genética , Receptor com Domínio Discoidina 1 , Neoplasias do Endométrio/patologia , Feminino , Humanos , Instabilidade de Microssatélites , Gradação de Tumores , Proteínas de Ligação a Poli-ADP-Ribose , Domínios e Motivos de Interação entre Proteínas/genética
18.
BMC Bioinformatics ; 15: 329, 2014 Oct 13.
Artigo em Inglês | MEDLINE | ID: mdl-25311246

RESUMO

BACKGROUND: Quantification of a transcriptional profile is a useful way to evaluate the activity of a cell at a given point in time. Although RNA-Seq has revolutionized transcriptional profiling, the costs of RNA-Seq are still significantly higher than microarrays, and often the depth of data delivered from RNA-Seq is in excess of what is needed for simple transcript quantification. Digital Gene Expression (DGE) is a cost-effective, sequence-based approach for simple transcript quantification: by sequencing one read per molecule of RNA, this technique can be used to efficiently count transcripts while obviating the need for transcript-length normalization and reducing the total numbers of reads necessary for accurate quantification. Here, we present trieFinder, a program specifically designed to rapidly map, parse, and annotate DGE tags of various lengths against cDNA and/or genomic sequence databases. RESULTS: The trieFinder algorithm maps DGE tags in a two-step process. First, it scans FASTA files of RefSeq, UniGene, and genomic DNA sequences to create a database of all tags that can be derived from a predefined restriction site. Next, it compares the experimental DGE tags to this tag database, taking advantage of the fact that the tags are stored as a prefix tree, or "trie", which allows for linear-time searches for exact matches. DGE tags with mismatches are analyzed by recursive calls in the data structure. We find that, in terms of alignment speed, the mapping functionality of trieFinder compares favorably with Bowtie. CONCLUSIONS: trieFinder can quickly provide the user an annotation of the DGE tags from three sources simultaneously, simplifying transcript quantification and novel transcript detection, delivering the data in a simple parsed format, obviating the need to post-process the alignment results. trieFinder is available at http://research.nhgri.nih.gov/software/trieFinder/.


Assuntos
Algoritmos , Perfilação da Expressão Gênica/métodos , Anotação de Sequência Molecular/métodos , DNA Complementar/genética , Bases de Dados Genéticas , Genômica , Análise de Sequência de DNA , Software
19.
BMC Genomics ; 15: 316, 2014 Apr 28.
Artigo em Inglês | MEDLINE | ID: mdl-24773765

RESUMO

BACKGROUND: Mnemiopsis leidyi is a ctenophore native to the coastal waters of the western Atlantic Ocean. A number of studies on Mnemiopsis have led to a better understanding of many key biological processes, and these studies have contributed to the emergence of Mnemiopsis as an important model for evolutionary and developmental studies. Recently, we sequenced, assembled, annotated, and performed a preliminary analysis on the 150-megabase genome of the ctenophore, Mnemiopsis. This sequencing effort has produced the first set of whole-genome sequencing data on any ctenophore species and is amongst the first wave of projects to sequence an animal genome de novo solely using next-generation sequencing technologies. DESCRIPTION: The Mnemiopsis Genome Project Portal (http://research.nhgri.nih.gov/mnemiopsis/) is intended both as a resource for obtaining genomic information on Mnemiopsis through an intuitive and easy-to-use interface and as a model for developing customized Web portals that enable access to genomic data. The scope of data available through this Portal goes well beyond the sequence data available through GenBank, providing key biological information not available elsewhere, such as pathway and protein domain analyses; it also features a customized genome browser for data visualization. CONCLUSIONS: We expect that the availability of these data will allow investigators to advance their own research projects aimed at understanding phylogenetic diversity and the evolution of proteins that play a fundamental role in metazoan development. The overall approach taken in the development of this Web site can serve as a viable model for disseminating data from whole-genome sequencing projects, framed in a way that best-serves the specific needs of the scientific community.


Assuntos
Ctenóforos/genética , Genoma , Internet , Animais
20.
Nucleic Acids Res ; 42(7): 4257-69, 2014 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-24464997

RESUMO

Retroviruses integrate into the host genome in patterns specific to each virus. Understanding the causes of these patterns can provide insight into viral integration mechanisms, pathology and genome evolution, and is critical to the development of safe gene therapy vectors. We generated murine leukemia virus integrations in human HepG2 and K562 cells and subjected them to second-generation sequencing, using a DNA barcoding technique that allowed us to quantify independent integration events. We characterized >3,700,000 unique integration events in two ENCODE-characterized cell lines. We find that integrations were most highly enriched in a subset of strong enhancers and active promoters. In both cell types, approximately half the integrations were found in <2% of the genome, demonstrating genomic influences even narrower than previously believed. The integration pattern of murine leukemia virus appears to be largely driven by regions that have high enrichment for multiple marks of active chromatin; the combination of histone marks present was sufficient to explain why some strong enhancers were more prone to integration than others. The approach we used is applicable to analyzing the integration pattern of any exogenous element and could be a valuable preclinical screen to evaluate the safety of gene therapy vectors.


Assuntos
Sítios de Ligação Microbiológicos , Elementos Facilitadores Genéticos , Vírus da Leucemia Murina de Moloney/fisiologia , Regiões Promotoras Genéticas , Integração Viral , Linhagem Celular Tumoral , Humanos , Células K562
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...