Pesquisa | Portal de Pesquisa da BVS

1.

scDecouple: decoupling cellular response from infected proportion bias in scCRISPR-seq.

Meng, Qiuchen; Wei, Lei; Ma, Kun; Shi, Ming; Lin, Xinyi; Ho, Joshua W K; Li, Yinqing; Zhang, Xuegong.

Brief Bioinform ; 25(2)2024 Jan 22.

Artigo em Inglês | MEDLINE | ID: mdl-38324621

RESUMO

Single-cell clustered regularly interspaced short palindromic repeats-sequencing (scCRISPR-seq) is an emerging high-throughput CRISPR screening technology where the true cellular response to perturbation is coupled with infected proportion bias of guide RNAs (gRNAs) across different cell clusters. The mixing of these effects introduces noise into scCRISPR-seq data analysis and thus obstacles to relevant studies. We developed scDecouple to decouple true cellular response of perturbation from the influence of infected proportion bias. scDecouple first models the distribution of gene expression profiles in perturbed cells and then iteratively finds the maximum likelihood of cell cluster proportions as well as the cellular response for each gRNA. We demonstrated its performance in a series of simulation experiments. By applying scDecouple to real scCRISPR-seq data, we found that scDecouple enhances the identification of biologically perturbation-related genes. scDecouple can benefit scCRISPR-seq data analysis, especially in the case of heterogeneous samples or complex gRNA libraries.

Assuntos

Ensaios de Triagem em Larga Escala , RNA Guia de Sistemas CRISPR-Cas

2.

Inferring cell diversity in single cell data using consortium-scale epigenetic data as a biological anchor for cell identity.

Sun, Yuliangzi; Shim, Woo Jun; Shen, Sophie; Sinniah, Enakshi; Pham, Duy; Su, Zezhuo; Mizikovsky, Dalia; White, Melanie D; Ho, Joshua W K; Nguyen, Quan; Bodén, Mikael; Palpant, Nathan J.

Nucleic Acids Res ; 51(11): e62, 2023 06 23.

Artigo em Inglês | MEDLINE | ID: mdl-37125641

RESUMO

Methods for cell clustering and gene expression from single-cell RNA sequencing (scRNA-seq) data are essential for biological interpretation of cell processes. Here, we present TRIAGE-Cluster which uses genome-wide epigenetic data from diverse bio-samples to identify genes demarcating cell diversity in scRNA-seq data. By integrating patterns of repressive chromatin deposited across diverse cell types with weighted density estimation, TRIAGE-Cluster determines cell type clusters in a 2D UMAP space. We then present TRIAGE-ParseR, a machine learning method which evaluates gene expression rank lists to define gene groups governing the identity and function of cell types. We demonstrate the utility of this two-step approach using atlases of in vivo and in vitro cell diversification and organogenesis. We also provide a web accessible dashboard for analysis and download of data and software. Collectively, genome-wide epigenetic repression provides a versatile strategy to define cell diversity and study gene regulation of scRNA-seq data.

Assuntos

Perfilação da Expressão Gênica , Análise de Célula Única , Perfilação da Expressão Gênica/métodos , Análise de Sequência de RNA/métodos , Análise de Célula Única/métodos , Software , Análise por Conglomerados , Epigênese Genética , Algoritmos

3.

Cellular diversity and lineage trajectory: insights from mouse single cell transcriptomes.

Tam, Patrick P L; Ho, Joshua W K.

Development ; 147(2)2020 01 24.

Artigo em Inglês | MEDLINE | ID: mdl-31980483

RESUMO

Single cell RNA-sequencing (scRNA-seq) technology has matured to the point that it is possible to generate large single cell atlases of developing mouse embryos. These atlases allow the dissection of developmental cell lineages and molecular changes during embryogenesis. When coupled with single cell technologies for profiling the chromatin landscape, epigenome, proteome and metabolome, and spatial tissue organisation, these scRNA-seq approaches can now collect a large volume of multi-omic data about mouse embryogenesis. In addition, advances in computational techniques have enabled the inference of developmental lineages of differentiating cells, even without explicitly introduced genetic markers. This Spotlight discusses recent advent of single cell experimental and computational methods, and key insights from applying these methods to the study of mouse embryonic development. We highlight challenges in analysing and interpreting these data to complement and expand our knowledge from traditional developmental biology studies in relation to cell identity, diversity and lineage differentiation.

Assuntos

Linhagem da Célula/genética , Análise de Célula Única , Transcriptoma/genética , Animais , Desenvolvimento Embrionário/genética , Regulação da Expressão Gênica no Desenvolvimento , Camundongos

4.

FlowGrid enables fast clustering of very large single-cell RNA-seq data.

Fang, Xiunan; Ho, Joshua W K.

Bioinformatics ; 38(1): 282-283, 2021 12 22.

Artigo em Inglês | MEDLINE | ID: mdl-34289014

RESUMO

MOTIVATION: Scalable clustering algorithms are needed to analyze millions of cells in single cell RNA-seq (scRNA-seq) data. RESULTS: Here, we present an open source python package called FlowGrid that can integrate into the Scanpy workflow to perform clustering on very large scRNA-seq datasets. FlowGrid implements a fast density-based clustering algorithm originally designed for flow cytometry data analysis. We introduce a new automated parameter tuning procedure, and show that FlowGrid can achieve comparable clustering accuracy as state-of-the-art clustering algorithms but at a substantially reduced run time for very large single cell RNA-seq datasets. For example, FlowGrid can complete a one-hour clustering task for one million cells in about five min. AVAILABILITY AND IMPLEMENTATION: https://github.com/holab-hku/FlowGrid. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Perfilação da Expressão Gênica , Análise da Expressão Gênica de Célula Única , Perfilação da Expressão Gênica/métodos , Análise de Sequência de RNA/métodos , Análise de Célula Única/métodos , Algoritmos , Análise por Conglomerados , Software

5.

Dynamic changes in antibiotic resistance genes and gut microbiota after Helicobacter pylori eradication therapies.

Wang, Lingling; Yao, Haobin; Tong, Teresa; Lau, KamShing; Leung, Suet Yi; Ho, Joshua W K; Leung, Wai K.

Helicobacter ; 27(2): e12871, 2022 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-34969161

RESUMO

BACKGROUND: Short-term antibiotics exposure is associated with alterations in microbiota and antibiotic resistance genes (ARGs) in the human gut. While antibiotics are critical in the successful eradication of Helicobacter pylori, the short-term and long-term impacts on the composition and quantity of antibiotics resistance genes after H. pylori eradication are unclear. This study used whole-genome shotgun metagenomic of stool samples to characterize the gut microbiota and ARGs, before and after H. pylori eradication therapy. RESULTS: Forty-four H. pylori-infected patients were recruited, including 21 treatment naïve patients who received clarithromycin-based triple therapy (CLA group) and 23 patients who failed previous therapies, in which 10 received levofloxacin-based quadruple therapy (LEVO group) and 13 received other combinations (OTHER group). Stool samples were collected at baseline (before current treatment), 6 week and 6 month after eradication therapy. At baseline, there was only a slight difference among the three groups on ARGs and gut microbiota. After eradication therapy, there was a transient but significant increase in gut ARGs 6 week post-therapy, among which the LEVO group had the most significant ARGs alteration compared to other two groups. For treatment naïve patients, those with higher ErmF abundance were prone to fail CLA eradication and gain more ARGs after treatment. For gut microbiota, the bacteria richness decreased at 6 week and there was a significant difference in microbiota community among the three groups at 6 week. CONCLUSIONS: Our findings demonstrated the dynamic alterations in gut microbiota and ARGs induced by different eradication therapies, which could influence the choices of antibiotics in eradication therapy.

Assuntos

Microbioma Gastrointestinal , Infecções por Helicobacter , Helicobacter pylori , Amoxicilina/uso terapêutico , Antibacterianos/farmacologia , Antibacterianos/uso terapêutico , Claritromicina/farmacologia , Claritromicina/uso terapêutico , Resistência Microbiana a Medicamentos , Quimioterapia Combinada , Microbioma Gastrointestinal/genética , Infecções por Helicobacter/microbiologia , Helicobacter pylori/genética , Humanos

6.

Light-focusing human micro-lenses generated from pluripotent stem cells model lens development and drug-induced cataract in vitro.

Murphy, Patricia; Kabir, Md Humayun; Srivastava, Tarini; Mason, Michele E; Dewi, Chitra U; Lim, Seakcheng; Yang, Andrian; Djordjevic, Djordje; Killingsworth, Murray C; Ho, Joshua W K; Harman, David G; O'Connor, Michael D.

Development ; 145(1)2018 01 09.

Artigo em Inglês | MEDLINE | ID: mdl-29217756

RESUMO

Cataracts cause vision loss and blindness by impairing the ability of the ocular lens to focus light onto the retina. Various cataract risk factors have been identified, including drug treatments, age, smoking and diabetes. However, the molecular events responsible for these different forms of cataract are ill-defined, and the advent of modern cataract surgery in the 1960s virtually eliminated access to human lenses for research. Here, we demonstrate large-scale production of light-focusing human micro-lenses from spheroidal masses of human lens epithelial cells purified from differentiating pluripotent stem cells. The purified lens cells and micro-lenses display similar morphology, cellular arrangement, mRNA expression and protein expression to human lens cells and lenses. Exposing the micro-lenses to the emergent cystic fibrosis drug Vx-770 reduces micro-lens transparency and focusing ability. These human micro-lenses provide a powerful and large-scale platform for defining molecular disease mechanisms caused by cataract risk factors, for anti-cataract drug screening and for clinically relevant toxicity assays.

Assuntos

Aminofenóis/efeitos adversos , Catarata/induzido quimicamente , Catarata/metabolismo , Cristalino/metabolismo , Modelos Biológicos , Células-Tronco Pluripotentes/metabolismo , Quinolonas/efeitos adversos , Aminofenóis/farmacologia , Catarata/patologia , Humanos , Cristalino/patologia , Células-Tronco Pluripotentes/patologia , Quinolonas/farmacologia

7.

dv-trio: a family-based variant calling pipeline using DeepVariant.

Ip, Eddie K K; Hadinata, Clinton; Ho, Joshua W K; Giannoulatou, Eleni.

Bioinformatics ; 36(11): 3549-3551, 2020 06 01.

Artigo em Inglês | MEDLINE | ID: mdl-32315409

RESUMO

MOTIVATION: In 2018, Google published an innovative variant caller, DeepVariant, which converts pileups of sequence reads into images and uses a deep neural network to identify single-nucleotide variants and small insertion/deletions from next-generation sequencing data. This approach outperforms existing state-of-the-art tools. However, DeepVariant was designed to call variants within a single sample. In disease sequencing studies, the ability to examine a family trio (father-mother-affected child) provides greater power for disease mutation discovery. RESULTS: To further improve DeepVariant's variant calling accuracy in family-based sequencing studies, we have developed a family-based variant calling pipeline, dv-trio, which incorporates the trio information from the Mendelian genetic model into variant calling based on DeepVariant. AVAILABILITY AND IMPLEMENTATION: dv-trio is available via an open source BSD3 license at GitHub (https://github.com/VCCRI/dv-trio/). CONTACT: e.giannoulatou@victorchang.edu.au. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Sequenciamento de Nucleotídeos em Larga Escala , Mutação INDEL , Criança , Humanos , Mutação , Redes Neurais de Computação , Software

8.

PARC: ultrafast and accurate clustering of phenotypic data of millions of single cells.

Stassen, Shobana V; Siu, Dickson M D; Lee, Kelvin C M; Ho, Joshua W K; So, Hayden K H; Tsia, Kevin K.

Bioinformatics ; 36(9): 2778-2786, 2020 05 01.

Artigo em Inglês | MEDLINE | ID: mdl-31971583

RESUMO

MOTIVATION: New single-cell technologies continue to fuel the explosive growth in the scale of heterogeneous single-cell data. However, existing computational methods are inadequately scalable to large datasets and therefore cannot uncover the complex cellular heterogeneity. RESULTS: We introduce a highly scalable graph-based clustering algorithm PARC-Phenotyping by Accelerated Refined Community-partitioning-for large-scale, high-dimensional single-cell data (>1 million cells). Using large single-cell flow and mass cytometry, RNA-seq and imaging-based biophysical data, we demonstrate that PARC consistently outperforms state-of-the-art clustering algorithms without subsampling of cells, including Phenograph, FlowSOM and Flock, in terms of both speed and ability to robustly detect rare cell populations. For example, PARC can cluster a single-cell dataset of 1.1 million cells within 13 min, compared with >2 h for the next fastest graph-clustering algorithm. Our work presents a scalable algorithm to cope with increasingly large-scale single-cell analysis. AVAILABILITY AND IMPLEMENTATION: https://github.com/ShobiStassen/PARC. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Algoritmos , Análise de Célula Única , Análise por Conglomerados , RNA-Seq , Software , Sequenciamento do Exoma

9.

The short isoform of the CEACAM1 receptor in intestinal T cells regulates mucosal immunity and homeostasis via Tfh cell induction.

Chen, Lanfen; Chen, Zhangguo; Baker, Kristi; Halvorsen, Elizabeth M; da Cunha, Andre Pires; Flak, Magdalena B; Gerber, Georg; Huang, Yu-Hwa; Hosomi, Shuhei; Arthur, Janelle C; Dery, Ken J; Nagaishi, Takashi; Beauchemin, Nicole; Holmes, Kathryn V; Ho, Joshua W K; Shively, John E; Jobin, Christian; Onderdonk, Andrew B; Bry, Lynn; Weiner, Howard L; Higgins, Darren E; Blumberg, Richard S.

Immunity ; 37(5): 930-46, 2012 Nov 16.

Artigo em Inglês | MEDLINE | ID: mdl-23123061

RESUMO

Carcinoembryonic antigen cell adhesion molecule like I (CEACAM1) is expressed on activated T cells and signals through either a long (L) cytoplasmic tail containing immune receptor tyrosine based inhibitory motifs, which provide inhibitory function, or a short (S) cytoplasmic tail with an unknown role. Previous studies on peripheral T cells show that CEACAM1-L isoforms predominate with little to no detectable CEACAM1-S isoforms in mouse and human. We show here that this was not the case in tissue resident T cells of intestines and gut associated lymphoid tissues, which demonstrated predominant expression of CEACAM1-S isoforms relative to CEACAM1-L isoforms in human and mouse. This tissue resident predominance of CEACAM1-S expression was determined by the intestinal environment where it served a stimulatory function leading to the regulation of T cell subsets associated with the generation of secretory IgA immunity, the regulation of mucosal commensalism, and defense of the barrier against enteropathogens.

Assuntos

Antígeno Carcinoembrionário/imunologia , Imunidade nas Mucosas/imunologia , Intestinos/imunologia , Linfócitos T/imunologia , Motivos de Aminoácidos/genética , Motivos de Aminoácidos/imunologia , Animais , Antígeno Carcinoembrionário/genética , Antígeno Carcinoembrionário/metabolismo , Citoplasma/genética , Citoplasma/imunologia , Citoplasma/metabolismo , Homeostase , Imunidade nas Mucosas/genética , Imunoglobulina A/genética , Imunoglobulina A/imunologia , Imunoglobulina A/metabolismo , Mucosa Intestinal/metabolismo , Listeria monocytogenes/imunologia , Listeriose/imunologia , Ativação Linfocitária , Metagenoma/imunologia , Camundongos , Camundongos Endogâmicos C57BL , Fatores de Transcrição NFATC/genética , Fatores de Transcrição NFATC/metabolismo , Isoformas de Proteínas , Receptores Imunológicos/genética , Receptores Imunológicos/imunologia , Receptores Imunológicos/metabolismo , Linfócitos T/metabolismo , Tirosina/genética , Tirosina/imunologia , Tirosina/metabolismo

10.

Ularcirc: visualization and enhanced analysis of circular RNAs via back and canonical forward splicing.

Humphreys, David T; Fossat, Nicolas; Demuth, Madeleine; Tam, Patrick P L; Ho, Joshua W K.

Nucleic Acids Res ; 47(20): e123, 2019 11 18.

Artigo em Inglês | MEDLINE | ID: mdl-31435647

RESUMO

Circular RNAs (circRNA) are a unique class of transcripts that can only be identified from sequence alignments spanning discordant junctions, commonly referred to as backsplice junctions (BSJ). Canonical splicing is also linked with circRNA biogenesis either from the parental transcript or internal to the circRNA, and is not fully utilized in circRNA software. Here we present Ularcirc, a software tool that integrates the visualization of both BSJ and forward splicing junctions and provides downstream analysis of selected circRNA candidates. Ularcirc utilizes the output of CIRI, circExplorer, or raw chimeric output of the STAR aligner and assembles BSJ count table to allow multi-sample analysis. We used Ularcirc to identify and characterize circRNA from public and in-house generated data sets and demonstrate how it can be used to (i) discover novel splicing patterns of parental transcripts, (ii) detect internal splicing patterns of circRNA, and (iii) reveal the complexity of BSJ formation. Furthermore, we identify circRNA that have potential open reading frames longer than their linear sequence. Finally, we detected and validated the presence of a novel class of circRNA generated from ApoA4 transcripts whose BSJ derive from multiple non-canonical splicing sites within coding exons. Ularcirc is accessed via https://github.com/VCCRI/Ularcirc.

Assuntos

Sítios de Splice de RNA , RNA Circular/genética , Software , Humanos , Splicing de RNA , RNA Circular/química , RNA Circular/metabolismo , Análise de Sequência de RNA/métodos

11.

NAD Deficiency, Congenital Malformations, and Niacin Supplementation.

Shi, Hongjun; Enriquez, Annabelle; Rapadas, Melissa; Martin, Ella M M A; Wang, Roni; Moreau, Julie; Lim, Chai K; Szot, Justin O; Ip, Eddie; Hughes, James N; Sugimoto, Kotaro; Humphreys, David T; McInerney-Leo, Aideen M; Leo, Paul J; Maghzal, Ghassan J; Halliday, Jake; Smith, Janine; Colley, Alison; Mark, Paul R; Collins, Felicity; Sillence, David O; Winlaw, David S; Ho, Joshua W K; Guillemin, Gilles J; Brown, Matthew A; Kikuchi, Kazu; Thomas, Paul Q; Stocker, Roland; Giannoulatou, Eleni; Chapman, Gavin; Duncan, Emma L; Sparrow, Duncan B; Dunwoodie, Sally L.

N Engl J Med ; 377(6): 544-552, 2017 08 10.

Artigo em Inglês | MEDLINE | ID: mdl-28792876

RESUMO

BACKGROUND: Congenital malformations can be manifested as combinations of phenotypes that co-occur more often than expected by chance. In many such cases, it has proved difficult to identify a genetic cause. We sought the genetic cause of cardiac, vertebral, and renal defects, among others, in unrelated patients. METHODS: We used genomic sequencing to identify potentially pathogenic gene variants in families in which a person had multiple congenital malformations. We tested the function of the variant by using assays of in vitro enzyme activity and by quantifying metabolites in patient plasma. We engineered mouse models with similar variants using the CRISPR (clustered regularly interspaced short palindromic repeats)-Cas9 system. RESULTS: Variants were identified in two genes that encode enzymes of the kynurenine pathway, 3-hydroxyanthranilic acid 3,4-dioxygenase (HAAO) and kynureninase (KYNU). Three patients carried homozygous variants predicting loss-of-function changes in the HAAO or KYNU proteins (HAAO p.D162*, HAAO p.W186*, or KYNU p.V57Efs*21). Another patient carried heterozygous KYNU variants (p.Y156* and p.F349Kfs*4). The mutant enzymes had greatly reduced activity in vitro. Nicotinamide adenine dinucleotide (NAD) is synthesized de novo from tryptophan through the kynurenine pathway. The patients had reduced levels of circulating NAD. Defects similar to those in the patients developed in the embryos of Haao-null or Kynu-null mice owing to NAD deficiency. In null mice, the prevention of NAD deficiency during gestation averted defects. CONCLUSIONS: Disruption of NAD synthesis caused a deficiency of NAD and congenital malformations in humans and mice. Niacin supplementation during gestation prevented the malformations in mice. (Funded by the National Health and Medical Research Council of Australia and others.).

Assuntos

3-Hidroxiantranilato 3,4-Dioxigenase/genética , Anormalidades Congênitas/genética , Suplementos Nutricionais , Hidrolases/genética , NAD/deficiência , Niacina/uso terapêutico , 3-Hidroxiantranilato 3,4-Dioxigenase/metabolismo , Canal Anal/anormalidades , Animais , Anormalidades Congênitas/prevenção & controle , Modelos Animais de Doenças , Esôfago/anormalidades , Feminino , Cardiopatias Congênitas/genética , Cardiopatias Congênitas/prevenção & controle , Humanos , Hidrolases/metabolismo , Rim/anormalidades , Deformidades Congênitas dos Membros/genética , Deformidades Congênitas dos Membros/prevenção & controle , Masculino , Camundongos , Camundongos Knockout , Mutação , NAD/biossíntese , NAD/genética , Análise de Sequência de DNA , Coluna Vertebral/anormalidades , Traqueia/anormalidades

12.

Multi-omic profiling reveals associations between the gut mucosal microbiome, the metabolome, and host DNA methylation associated gene expression in patients with colorectal cancer.

Wang, Qing; Ye, Jianzhong; Fang, Daiqiong; Lv, Longxian; Wu, Wenrui; Shi, Ding; Li, Yating; Yang, Liya; Bian, Xiaoyuan; Wu, Jingjing; Jiang, Xianwan; Wang, Kaicen; Wang, Qiangqiang; Hodson, Mark P; Thibaut, Loïc M; Ho, Joshua W K; Giannoulatou, Eleni; Li, Lanjuan.

BMC Microbiol ; 20(Suppl 1): 83, 2020 04 23.

Artigo em Inglês | MEDLINE | ID: mdl-32321427

RESUMO

BACKGROUND: The human gut microbiome plays a critical role in the carcinogenesis of colorectal cancer (CRC). However, a comprehensive analysis of the interaction between the host and microbiome is still lacking. RESULTS: We found correlations between the change in abundance of microbial taxa, butyrate-related colonic metabolites, and methylation-associated host gene expression in colonic tumour mucosa tissues compared with the adjacent normal mucosa tissues. The increase of genus Fusobacterium abundance was correlated with a decrease in the level of 4-hydroxybutyric acid (4-HB) and expression of immune-related peptidase inhibitor 16 (PI16), Fc Receptor Like A (FCRLA) and Lymphocyte Specific Protein 1 (LSP1). The decrease in the abundance of another potentially 4-HB-associated genus, Prevotella 2, was also found to be correlated with the down-regulated expression of metallothionein 1 M (MT1M). Additionally, the increase of glutamic acid-related family Halomonadaceae was correlated with the decreased expression of reelin (RELN). The decreased abundance of genus Paeniclostridium and genus Enterococcus were correlated with increased lactic acid level, and were also linked to the expression change of Phospholipase C Beta 1 (PLCB1) and Immunoglobulin Superfamily Member 9 (IGSF9) respectively. Interestingly, 4-HB, glutamic acid and lactic acid are all butyrate precursors, which may modify gene expression by epigenetic regulation such as DNA methylation. CONCLUSIONS: Our study identified associations between previously reported CRC-related microbial taxa, butyrate-related metabolites and DNA methylation-associated gene expression in tumour and normal colonic mucosa tissues from CRC patients, which uncovered a possible mechanism of the role of microbiome in the carcinogenesis of CRC. In addition, these findings offer insight into potential new biomarkers, therapeutic and/or prevention strategies for CRC.

Assuntos

Neoplasias Colorretais/microbiologia , Microbioma Gastrointestinal/fisiologia , Mucosa Intestinal/microbiologia , Bactérias/classificação , Bactérias/genética , Bactérias/isolamento & purificação , Bactérias/metabolismo , Butiratos/metabolismo , Colo/metabolismo , Colo/microbiologia , Colo/patologia , Neoplasias Colorretais/genética , Neoplasias Colorretais/metabolismo , Neoplasias Colorretais/patologia , Metilação de DNA , Epigênese Genética , Regulação Neoplásica da Expressão Gênica , Humanos , Mucosa Intestinal/metabolismo , Metaboloma , Proteína Reelina , Transcriptoma

13.

Comparative analysis of metazoan chromatin organization.

Ho, Joshua W K; Jung, Youngsook L; Liu, Tao; Alver, Burak H; Lee, Soohyun; Ikegami, Kohta; Sohn, Kyung-Ah; Minoda, Aki; Tolstorukov, Michael Y; Appert, Alex; Parker, Stephen C J; Gu, Tingting; Kundaje, Anshul; Riddle, Nicole C; Bishop, Eric; Egelhofer, Thea A; Hu, Sheng'en Shawn; Alekseyenko, Artyom A; Rechtsteiner, Andreas; Asker, Dalal; Belsky, Jason A; Bowman, Sarah K; Chen, Q Brent; Chen, Ron A-J; Day, Daniel S; Dong, Yan; Dose, Andrea C; Duan, Xikun; Epstein, Charles B; Ercan, Sevinc; Feingold, Elise A; Ferrari, Francesco; Garrigues, Jacob M; Gehlenborg, Nils; Good, Peter J; Haseley, Psalm; He, Daniel; Herrmann, Moritz; Hoffman, Michael M; Jeffers, Tess E; Kharchenko, Peter V; Kolasinska-Zwierz, Paulina; Kotwaliwale, Chitra V; Kumar, Nischay; Langley, Sasha A; Larschan, Erica N; Latorre, Isabel; Libbrecht, Maxwell W; Lin, Xueqiu; Park, Richard.

Nature ; 512(7515): 449-52, 2014 Aug 28.

Artigo em Inglês | MEDLINE | ID: mdl-25164756

RESUMO

Genome function is dynamically regulated in part by chromatin, which consists of the histones, non-histone proteins and RNA molecules that package DNA. Studies in Caenorhabditis elegans and Drosophila melanogaster have contributed substantially to our understanding of molecular mechanisms of genome function in humans, and have revealed conservation of chromatin components and mechanisms. Nevertheless, the three organisms have markedly different genome sizes, chromosome architecture and gene organization. On human and fly chromosomes, for example, pericentric heterochromatin flanks single centromeres, whereas worm chromosomes have dispersed heterochromatin-like regions enriched in the distal chromosomal 'arms', and centromeres distributed along their lengths. To systematically investigate chromatin organization and associated gene regulation across species, we generated and analysed a large collection of genome-wide chromatin data sets from cell lines and developmental stages in worm, fly and human. Here we present over 800 new data sets from our ENCODE and modENCODE consortia, bringing the total to over 1,400. Comparison of combinatorial patterns of histone modifications, nuclear lamina-associated domains, organization of large-scale topological domains, chromatin environment at promoters and enhancers, nucleosome positioning, and DNA replication patterns reveals many conserved features of chromatin organization among the three organisms. We also find notable differences in the composition and locations of repressive chromatin. These data sets and analyses provide a rich resource for comparative and species-specific investigations of chromatin composition, organization and function.

Assuntos

Caenorhabditis elegans/citologia , Caenorhabditis elegans/genética , Cromatina/genética , Cromatina/metabolismo , Drosophila melanogaster/citologia , Drosophila melanogaster/genética , Animais , Linhagem Celular , Centrômero/genética , Centrômero/metabolismo , Cromatina/química , Montagem e Desmontagem da Cromatina/genética , Replicação do DNA/genética , Elementos Facilitadores Genéticos/genética , Epigênese Genética , Heterocromatina/química , Heterocromatina/genética , Heterocromatina/metabolismo , Histonas/química , Histonas/metabolismo , Humanos , Anotação de Sequência Molecular , Lâmina Nuclear/metabolismo , Nucleossomos/química , Nucleossomos/genética , Nucleossomos/metabolismo , Regiões Promotoras Genéticas/genética , Especificidade da Espécie

14.

iSyTE 2.0: a database for expression-based gene discovery in the eye.

Kakrana, Atul; Yang, Andrian; Anand, Deepti; Djordjevic, Djordje; Ramachandruni, Deepti; Singh, Abhyudai; Huang, Hongzhan; Ho, Joshua W K; Lachke, Salil A.

Nucleic Acids Res ; 46(D1): D875-D885, 2018 01 04.

Artigo em Inglês | MEDLINE | ID: mdl-29036527

RESUMO

Although successful in identifying new cataract-linked genes, the previous version of the database iSyTE (integrated Systems Tool for Eye gene discovery) was based on expression information on just three mouse lens stages and was functionally limited to visualization by only UCSC-Genome Browser tracks. To increase its efficacy, here we provide an enhanced iSyTE version 2.0 (URL: http://research.bioinformatics.udel.edu/iSyTE) based on well-curated, comprehensive genome-level lens expression data as a one-stop portal for the effective visualization and analysis of candidate genes in lens development and disease. iSyTE 2.0 includes all publicly available lens Affymetrix and Illumina microarray datasets representing a broad range of embryonic and postnatal stages from wild-type and specific gene-perturbation mouse mutants with eye defects. Further, we developed a new user-friendly web interface for direct access and cogent visualization of the curated expression data, which supports convenient searches and a range of downstream analyses. The utility of these new iSyTE 2.0 features is illustrated through examples of established genes associated with lens development and pathobiology, which serve as tutorials for its application by the end-user. iSyTE 2.0 will facilitate the prioritization of eye development and disease-linked candidate genes in studies involving transcriptomics or next-generation sequencing data, linkage analysis and GWAS approaches.

Assuntos

Catarata/genética , Bases de Dados Genéticas , Proteínas do Olho/genética , Expressão Gênica , Estudos de Associação Genética/métodos , Animais , Catarata/embriologia , Catarata/metabolismo , Conjuntos de Dados como Assunto , Modelos Animais de Doenças , Proteínas do Olho/biossíntese , Previsões , Perfilação da Expressão Gênica , Redes Reguladoras de Genes , Estudo de Associação Genômica Ampla , Humanos , Cristalino/embriologia , Cristalino/crescimento & desenvolvimento , Cristalino/metabolismo , Camundongos , Camundongos Mutantes , Análise de Sequência com Séries de Oligonucleotídeos , Interface Usuário-Computador

15.

Identification of satellite cells from anole lizard skeletal muscle and demonstration of expanded musculoskeletal potential.

Palade, Joanna; Djordjevic, Djordje; Hutchins, Elizabeth D; George, Rajani M; Cornelius, John A; Rawls, Alan; Ho, Joshua W K; Kusumi, Kenro; Wilson-Rawls, Jeanne.

Dev Biol ; 433(2): 344-356, 2018 01 15.

Artigo em Inglês | MEDLINE | ID: mdl-29291980

RESUMO

The lizards are evolutionarily the closest vertebrates to humans that demonstrate the ability to regenerate entire appendages containing cartilage, muscle, skin, and nervous tissue. We previously isolated PAX7-positive cells from muscle of the green anole lizard, Anolis carolinensis, that can differentiate into multinucleated myotubes and express the muscle structural protein, myosin heavy chain. Studying gene expression in these satellite/progenitor cell populations from A. carolinensis can provide insight into the mechanisms regulating tissue regeneration. We generated a transcriptome from proliferating lizard myoprogenitor cells and compared them to transcriptomes from the mouse and human tissues from the ENCODE project using XGSA, a statistical method for cross-species gene set analysis. These analyses determined that the lizard progenitor cell transcriptome was most similar to mammalian satellite cells. Further examination of specific GO categories of genes demonstrated that among genes with the highest level of expression in lizard satellite cells were an increased number of genetic regulators of chondrogenesis, as compared to mouse satellite cells. In micromass culture, lizard PAX7-positive cells formed Alcian blue and collagen 2a1 positive nodules, without the addition of exogenous morphogens, unlike their mouse counterparts. Subsequent quantitative RT-PCR confirmed up-regulation of expression of chondrogenic regulatory genes in lizard cells, including bmp2, sox9, runx2, and cartilage specific structural genes, aggrecan and collagen 2a1. Taken together, these data suggest that tail regeneration in lizards involves significant alterations in gene regulation with expanded musculoskeletal potency.

Assuntos

Lagartos/fisiologia , Músculo Esquelético/citologia , Células Satélites de Músculo Esquelético/fisiologia , Animais , Proteína Morfogenética Óssea 2/genética , Proteína Morfogenética Óssea 2/fisiologia , Linhagem da Célula , Células Cultivadas , Condrogênese/genética , Regulação da Expressão Gênica , Ontologia Genética , Humanos , Peptídeos e Proteínas de Sinalização Intracelular/genética , Peptídeos e Proteínas de Sinalização Intracelular/fisiologia , Camundongos , Desenvolvimento Muscular/genética , Proteínas Musculares/genética , Proteínas Musculares/fisiologia , Músculo Esquelético/fisiologia , Mioblastos/citologia , Fator de Transcrição PAX7/análise , Transdução de Sinais , Especificidade da Espécie , Transcriptoma

16.

Cloud accelerated alignment and assembly of full-length single-cell RNA-seq data using Falco.

Yang, Andrian; Kishore, Abhinav; Phipps, Benjamin; Ho, Joshua W K.

BMC Genomics ; 20(Suppl 10): 927, 2019 Dec 30.

Artigo em Inglês | MEDLINE | ID: mdl-31888474

RESUMO

BACKGROUND: Read alignment and transcript assembly are the core of RNA-seq analysis for transcript isoform discovery. Nonetheless, current tools are not designed to be scalable for analysis of full-length bulk or single cell RNA-seq (scRNA-seq) data. The previous version of our cloud-based tool Falco only focuses on RNA-seq read counting, but does not allow for more flexible steps such as alignment and read assembly. RESULTS: The Falco framework can harness the parallel and distributed computing environment in modern cloud platforms to accelerate read alignment and transcript assembly of full-length bulk RNA-seq and scRNA-seq data. There are two new modes in Falco: alignment-only and transcript assembly. In the alignment-only mode, Falco can speed up the alignment process by 2.5-16.4x based on two public scRNA-seq datasets when compared to alignment on a highly optimised standalone computer. Furthermore, it also provides a 10x average speed-up compared to alignment using published cloud-enabled tool for read alignment, Rail-RNA. In the transcript assembly mode, Falco can speed up the transcript assembly process by 1.7-16.5x compared to performing transcript assembly on a highly optimised computer. CONCLUSION: Falco is a significantly updated open source big data processing framework that enables scalable and accelerated alignment and assembly of full-length scRNA-seq data on the cloud. The source code can be found at https://github.com/VCCRI/Falco.

Assuntos

Computação em Nuvem , RNA-Seq , Alinhamento de Sequência/métodos , Análise de Célula Única , Éxons/genética , Humanos , Íntrons/genética

17.

Crim1 regulates integrin signaling in murine lens development.

Zhang, Ying; Fan, Jieqing; Ho, Joshua W K; Hu, Tommy; Kneeland, Stephen C; Fan, Xueping; Xi, Qiongchao; Sellarole, Michael A; de Vries, Wilhelmine N; Lu, Weining; Lachke, Salil A; Lang, Richard A; John, Simon W M; Maas, Richard L.

Development ; 143(2): 356-66, 2016 Jan 15.

Artigo em Inglês | MEDLINE | ID: mdl-26681494

RESUMO

The developing lens is a powerful system for investigating the molecular basis of inductive tissue interactions and for studying cataract, the leading cause of blindness. The formation of tightly controlled cell-cell adhesions and cell-matrix junctions between lens epithelial (LE) cells, between lens fiber (LF) cells, and between these two cell populations enables the vertebrate lens to adopt a highly ordered structure and acquire optical transparency. Adhesion molecules are thought to maintain this ordered structure, but little is known about their identity or interactions. Cysteine-rich motor neuron 1 (Crim1), a type I transmembrane protein, is strongly expressed in the developing lens and its mutation causes ocular disease in both mice and humans. How Crim1 regulates lens morphogenesis is not understood. We identified a novel ENU-induced hypomorphic allele of Crim1, Crim1(glcr11), which in the homozygous state causes cataract and microphthalmia. Using this and two other mutant alleles, Crim1(null) and Crim1(cko), we show that the lens defects in Crim1 mouse mutants originate from defective LE cell polarity, proliferation and cell adhesion. Crim1 adhesive function is likely to be required for interactions both between LE cells and between LE and LF cells. We show that Crim1 acts in LE cells, where it colocalizes with and regulates the levels of active ß1 integrin and of phosphorylated FAK and ERK. The RGD and transmembrane motifs of Crim1 are required for regulating FAK phosphorylation. These results identify an important function for Crim1 in the regulation of integrin- and FAK-mediated LE cell adhesion during lens development.

Assuntos

Receptores de Proteínas Morfogenéticas Ósseas/metabolismo , Cristalino/citologia , Animais , Receptores de Proteínas Morfogenéticas Ósseas/genética , Linhagem Celular , Regulação da Expressão Gênica no Desenvolvimento , Imuno-Histoquímica , Marcação In Situ das Extremidades Cortadas , Cristalino/metabolismo , Camundongos , Camundongos Endogâmicos C57BL , Organogênese/genética , Organogênese/fisiologia , Fosforilação , Transdução de Sinais/fisiologia

18.

Identification of clinically actionable variants from genome sequencing of families with congenital heart disease.

Alankarage, Dimuthu; Ip, Eddie; Szot, Justin O; Munro, Jacob; Blue, Gillian M; Harrison, Katrina; Cuny, Hartmut; Enriquez, Annabelle; Troup, Michael; Humphreys, David T; Wilson, Meredith; Harvey, Richard P; Sholler, Gary F; Graham, Robert M; Ho, Joshua W K; Kirk, Edwin P; Pachter, Nicholas; Chapman, Gavin; Winlaw, David S; Giannoulatou, Eleni; Dunwoodie, Sally L.

Genet Med ; 21(5): 1111-1120, 2019 05.

Artigo em Inglês | MEDLINE | ID: mdl-30293987

RESUMO

PURPOSE: Congenital heart disease (CHD) affects up to 1% of live births. However, a genetic diagnosis is not made in most cases. The purpose of this study was to assess the outcomes of genome sequencing (GS) of a heterogeneous cohort of CHD patients. METHODS: Ninety-seven families with probands born with CHD requiring surgical correction were recruited for genome sequencing. At minimum, a proband-parents trio was sequenced per family. GS data were analyzed via a two-tiered method: application of a high-confidence gene screen (hcCHD), and comprehensive analysis. Identified variants were assessed for pathogenicity using the American College of Medical Genetics and Genomics-Association for Molecular Pathology (ACMG-AMP) guidelines. RESULTS: Clinically relevant genetic variants in known and emerging CHD genes were identified. The hcCHD screen identified a clinically actionable variant in 22% of families. Subsequent comprehensive analysis identified a clinically actionable variant in an additional 9% of families in genes with recent disease associations. Overall, this two-tiered approach provided a clinically relevant variant for 31% of families. CONCLUSIONS: Interrogating GS data using our two-tiered method allowed identification of variants with high clinical utility in a third of our heterogeneous cohort. However, association of emerging genes with CHD etiology, and development of novel technologies for variant assessment and interpretation, will increase diagnostic yield during future reassessment of our GS data.

Assuntos

Testes Genéticos/métodos , Cardiopatias Congênitas/diagnóstico , Cardiopatias Congênitas/genética , Sequência de Bases/genética , Mapeamento Cromossômico/métodos , Estudos de Coortes , Exoma/genética , Família , Feminino , Predisposição Genética para Doença/genética , Variação Genética/genética , Genômica/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Masculino , Mutação/genética , Pais , Análise de Sequência de DNA/métodos , Sequenciamento Completo do Genoma/métodos

19.

PBrowse: a web-based platform for real-time collaborative exploration of genomic data.

Szot, Peter S; Yang, Andrian; Wang, Xin; Parsania, Chirag; Röhm, Uwe; Wong, Koon Ho; Ho, Joshua W K.

Nucleic Acids Res ; 45(9): e67, 2017 May 19.

Artigo em Inglês | MEDLINE | ID: mdl-28100700

RESUMO

Genome browsers are widely used for individually exploring various types of genomic data. A handful of genome browsers offer limited tools for collaboration among multiple users. Here, we describe PBrowse, an integrated real-time collaborative genome browser that enables multiple users to simultaneously view and access genomic data, thereby harnessing the wisdom of the crowd. PBrowse is based on the Dalliance genome browser and has a re-designed user and data management system with novel collaborative functionalities, including real-time collaborative view, track comment and an integrated group chat feature. Through the Distributed Annotation Server protocol, PBrowse can easily access a wide range of publicly available genomic data, such as the ENCODE data sets. We argue that PBrowse represents a paradigm shift from using a genome browser as a static data visualization tool to a platform that enables real-time human-human interaction and knowledge exchange in a collaborative setting. PBrowse is available at http://pbrowse.victorchang.edu.au, and its source code is available via an open source BSD 3 license at http://github.com/VCCRI/PBrowse.

Assuntos

Bases de Dados Genéticas , Internet , Navegador , Comportamento Cooperativo , Genoma Humano , Humanos , Armazenamento e Recuperação da Informação , Interface Usuário-Computador

20.

Discovery of cell-type specific DNA motif grammar in cis-regulatory elements using random Forest.

Wang, Xin; Lin, Peijie; Ho, Joshua W K.

BMC Genomics ; 19(Suppl 1): 929, 2018 01 19.

Artigo em Inglês | MEDLINE | ID: mdl-29363433

RESUMO

BACKGROUND: It has been observed that many transcription factors (TFs) can bind to different genomic loci depending on the cell type in which a TF is expressed in, even though the individual TF usually binds to the same core motif in different cell types. How a TF can bind to the genome in such a highly cell-type specific manner, is a critical research question. One hypothesis is that a TF requires co-binding of different TFs in different cell types. If this is the case, it may be possible to observe different combinations of TF motifs - a motif grammar - located at the TF binding sites in different cell types. In this study, we develop a bioinformatics method to systematically identify DNA motifs in TF binding sites across multiple cell types based on published ChIP-seq data, and address two questions: (1) can we build a machine learning classifier to predict cell-type specificity based on motif combinations alone, and (2) can we extract meaningful cell-type specific motif grammars from this classifier model. RESULTS: We present a Random Forest (RF) based approach to build a multi-class classifier to predict the cell-type specificity of a TF binding site given its motif content. We applied this RF classifier to two published ChIP-seq datasets of TF (TCF7L2 and MAX) across multiple cell types. Using cross-validation, we show that motif combinations alone are indeed predictive of cell types. Furthermore, we present a rule mining approach to extract the most discriminatory rules in the RF classifier, thus allowing us to discover the underlying cell-type specific motif grammar. CONCLUSIONS: Our bioinformatics analysis supports the hypothesis that combinatorial TF motif patterns are cell-type specific.

Assuntos

Algoritmos , Biologia Computacional/métodos , Neoplasias/genética , Motivos de Nucleotídeos , Elementos Reguladores de Transcrição , Fatores de Transcrição de Zíper de Leucina e Hélice-Alça-Hélix Básicos/classificação , Fatores de Transcrição de Zíper de Leucina e Hélice-Alça-Hélix Básicos/genética , Regulação Neoplásica da Expressão Gênica , Humanos , Neoplasias/classificação , Software , Proteína 2 Semelhante ao Fator 7 de Transcrição/classificação , Proteína 2 Semelhante ao Fator 7 de Transcrição/genética , Células Tumorais Cultivadas

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA