Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 7 de 7
Filtrar
1.
Nucleic Acids Res ; 47(W1): W175-W182, 2019 07 02.
Artigo em Inglês | MEDLINE | ID: mdl-31127311

RESUMO

The discovery and development of DNA-editing nucleases (Zinc Finger Nucleases, TALENs, CRISPR/Cas systems) has given scientists the ability to precisely engineer or edit genomes as never before. Several different platforms, protocols and vectors for precision genome editing are now available, leading to the development of supporting web-based software. Here we present the Gene Sculpt Suite (GSS), which comprises three tools: (i) GTagHD, which automatically designs and generates oligonucleotides for use with the GeneWeld knock-in protocol; (ii) MEDJED, a machine learning method, which predicts the extent to which a double-stranded DNA break site will utilize the microhomology-mediated repair pathway; and (iii) MENTHU, a tool for identifying genomic locations likely to give rise to a single predominant microhomology-mediated end joining allele (PreMA) repair outcome. All tools in the GSS are freely available for download under the GPL v3.0 license and can be run locally on Windows, Mac and Linux systems capable of running R and/or Docker. The GSS is also freely available online at www.genesculpt.org.


Assuntos
Bases de Dados Genéticas , Edição de Genes , Engenharia Genética/métodos , Software , Animais , Sistemas CRISPR-Cas/genética , Quebras de DNA de Cadeia Dupla , Humanos , Nucleases dos Efetores Semelhantes a Ativadores de Transcrição/genética , Nucleases de Dedos de Zinco/genética
2.
PLoS Genet ; 14(9): e1007652, 2018 09.
Artigo em Inglês | MEDLINE | ID: mdl-30208061

RESUMO

One key problem in precision genome editing is the unpredictable plurality of sequence outcomes at the site of targeted DNA double stranded breaks (DSBs). This is due to the typical activation of the versatile Non-homologous End Joining (NHEJ) pathway. Such unpredictability limits the utility of somatic gene editing for applications including gene therapy and functional genomics. For germline editing work, the accurate reproduction of the identical alleles using NHEJ is a labor intensive process. In this study, we propose Microhomology-mediated End Joining (MMEJ) as a viable solution for improving somatic sequence homogeneity in vivo, capable of generating a single predictable allele at high rates (56% ~ 86% of the entire mutant allele pool). Using a combined dataset from zebrafish (Danio rerio) in vivo and human HeLa cell in vitro, we identified specific contextual sequence determinants surrounding genomic DSBs for robust MMEJ pathway activation. We then applied our observation to prospectively design MMEJ-inducing sgRNAs against a variety of proof-of-principle genes and demonstrated high levels of mutant allele homogeneity. MMEJ-based DNA repair at these target loci successfully generated F0 mutant zebrafish embryos and larvae that faithfully recapitulated previously reported, recessive, loss-of-function phenotypes. We also tested the generalizability of our approach in cultured human cells. Finally, we provide a novel algorithm, MENTHU (http://genesculpt.org/menthu/), for improved and facile prediction of candidate MMEJ loci. We believe that this MMEJ-centric approach will have a broader impact on genome engineering and its applications. For example, whereas somatic mosaicism hinders efficient recreation of knockout mutant allele at base pair resolution via the standard NHEJ-based approach, we demonstrate that F0 founders transmitted the identical MMEJ allele of interest at high rates. Most importantly, the ability to directly dictate the reading frame of an endogenous target will have important implications for gene therapy applications in human genetic diseases.


Assuntos
Quebras de DNA de Cadeia Dupla , Reparo do DNA por Junção de Extremidades/genética , Edição de Genes/métodos , Modelos Genéticos , Algoritmos , Alelos , Animais , Estudos de Viabilidade , Feminino , Doenças Genéticas Inatas/genética , Doenças Genéticas Inatas/terapia , Terapia Genética/métodos , Células HeLa , Humanos , Masculino , Mutagênese Sítio-Dirigida , RNA Guia de Cinetoplastídeos/genética , RNA Guia de Cinetoplastídeos/metabolismo , Peixe-Zebra
3.
bioRxiv ; 2022 Nov 23.
Artigo em Inglês | MEDLINE | ID: mdl-36451881

RESUMO

We seek to transform how new and emergent variants of pandemic-causing viruses, specifically SARS-CoV-2, are identified and classified. By adapting large language models (LLMs) for genomic data, we build genome-scale language models (GenSLMs) which can learn the evolutionary landscape of SARS-CoV-2 genomes. By pre-training on over 110 million prokaryotic gene sequences and fine-tuning a SARS-CoV-2-specific model on 1.5 million genomes, we show that GenSLMs can accurately and rapidly identify variants of concern. Thus, to our knowledge, GenSLMs represents one of the first whole genome scale foundation models which can generalize to other prediction tasks. We demonstrate scaling of GenSLMs on GPU-based supercomputers and AI-hardware accelerators utilizing 1.63 Zettaflops in training runs with a sustained performance of 121 PFLOPS in mixed precision and peak of 850 PFLOPS. We present initial scientific insights from examining GenSLMs in tracking evolutionary dynamics of SARS-CoV-2, paving the path to realizing this on large biological data.

4.
Bio Protoc ; 11(14): e4100, 2021 Jul 20.
Artigo em Inglês | MEDLINE | ID: mdl-34395736

RESUMO

Efficient precision genome engineering requires high frequency and specificity of integration at the genomic target site. Multiple design strategies for zebrafish gene targeting have previously been reported with widely varying frequencies for germline recovery of integration alleles. The GeneWeld protocol and pGTag (plasmids for Gene Tagging) vector series provide a set of resources to streamline precision gene targeting in zebrafish. Our approach uses short homology of 24-48 bp to drive targeted integration of DNA reporter cassettes by homology-mediated end joining (HMEJ) at a CRISPR/Cas induced DNA double-strand break. The pGTag vectors contain reporters flanked by a universal CRISPR sgRNA sequence to liberate the targeting cassette in vivo and expose homology arms for homology-driven integration. Germline transmission rates for precision-targeted integration alleles range 22-100%. Our system provides a streamlined, straightforward, and cost-effective approach for high-efficiency gene targeting applications in zebrafish. Graphic abstract: GeneWeld method for CRISPR/Cas9 targeted integration.

5.
Elife ; 92020 05 15.
Artigo em Inglês | MEDLINE | ID: mdl-32412410

RESUMO

Efficient precision genome engineering requires high frequency and specificity of integration at the genomic target site. Here, we describe a set of resources to streamline reporter gene knock-ins in zebrafish and demonstrate the broader utility of the method in mammalian cells. Our approach uses short homology of 24-48 bp to drive targeted integration of DNA reporter cassettes by homology-mediated end joining (HMEJ) at high frequency at a double strand break in the targeted gene. Our vector series, pGTag (plasmids for Gene Tagging), contains reporters flanked by a universal CRISPR sgRNA sequence which enables in vivo liberation of the homology arms. We observed high rates of germline transmission (22-100%) for targeted knock-ins at eight zebrafish loci and efficient integration at safe harbor loci in porcine and human cells. Our system provides a straightforward and cost-effective approach for high efficiency gene targeting applications in CRISPR and TALEN compatible systems.


Assuntos
Proteínas Associadas a CRISPR/genética , Sistemas CRISPR-Cas , Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas , Técnicas de Introdução de Genes , Genes Reporter , Proteínas de Fluorescência Verde/genética , Nucleases dos Efetores Semelhantes a Ativadores de Transcrição/genética , Peixe-Zebra/genética , Animais , Animais Geneticamente Modificados , Proteínas Associadas a CRISPR/metabolismo , Fibroblastos/metabolismo , Regulação da Expressão Gênica , Proteínas de Fluorescência Verde/metabolismo , Humanos , Células K562 , Leucemia Mielogênica Crônica BCR-ABL Positiva/genética , Leucemia Mielogênica Crônica BCR-ABL Positiva/metabolismo , RNA Guia de Cinetoplastídeos/genética , RNA Guia de Cinetoplastídeos/metabolismo , Reparo de DNA por Recombinação , Homologia de Sequência do Ácido Nucleico , Sus scrofa , Nucleases dos Efetores Semelhantes a Ativadores de Transcrição/metabolismo
6.
Methods Mol Biol ; 1543: 169-185, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28349426

RESUMO

Experimental methods for identifying protein(s) bound by a specific promoter-associated RNA (paRNA) of interest can be expensive, difficult, and time-consuming. This chapter describes a general computational framework for identifying potential binding partners in RNA-protein complexes or RNA-protein interaction networks. Protocols for using three web-based tools to predict RNA-protein interaction partners are outlined. Also, tables listing additional webservers and software tools for predicting RNA-protein interactions, as well as databases that contain valuable information about known RNA-protein complexes and recognition sites for RNA-binding proteins, are provided. Although only one of the tools described, lncPro, was designed expressly to identify proteins that bind long noncoding RNAs (including paRNAs), all three approaches can be applied to predict potential binding partners for both coding and noncoding RNAs (ncRNAs).


Assuntos
Biologia Computacional/métodos , Proteínas de Ligação a RNA/química , Proteínas de Ligação a RNA/metabolismo , RNA/química , RNA/metabolismo , Software , Sítios de Ligação , Simulação por Computador , Bases de Dados Genéticas , Ligação Proteica , RNA/genética , Ferramenta de Busca , Máquina de Vetores de Suporte , Navegador
7.
Pac Symp Biocomput ; 21: 445-455, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-26776208

RESUMO

Efforts to predict interfacial residues in protein-RNA complexes have largely focused on predicting RNA-binding residues in proteins. Computational methods for predicting protein-binding residues in RNA sequences, however, are a problem that has received relatively little attention to date. Although the value of sequence motifs for classifying and annotating protein sequences is well established, sequence motifs have not been widely applied to predicting interfacial residues in macromolecular complexes. Here, we propose a novel sequence motif-based method for "partner-specific" interfacial residue prediction. Given a specific protein-RNA pair, the goal is to simultaneously predict RNA binding residues in the protein sequence and protein-binding residues in the RNA sequence. In 5-fold cross validation experiments, our method, PS-PRIP, achieved 92% Specificity and 61% Sensitivity, with a Matthews correlation coefficient (MCC) of 0.58 in predicting RNA-binding sites in proteins. The method achieved 69% Specificity and 75% Sensitivity, but with a low MCC of 0.13 in predicting protein binding sites in RNAs. Similar performance results were obtained when PS-PRIP was tested on two independent "blind" datasets of experimentally validated protein- RNA interactions, suggesting the method should be widely applicable and valuable for identifying potential interfacial residues in protein-RNA complexes for which structural information is not available. The PS-PRIP webserver and datasets are available at: http://pridb.gdcb.iastate.edu/PSPRIP/.


Assuntos
Proteínas de Ligação a RNA/química , Proteínas de Ligação a RNA/metabolismo , RNA/química , RNA/metabolismo , Motivos de Aminoácidos , Sequência de Aminoácidos , Sequência de Bases , Sítios de Ligação/genética , Biologia Computacional/métodos , Biologia Computacional/estatística & dados numéricos , Bases de Dados de Ácidos Nucleicos/estatística & dados numéricos , Bases de Dados de Proteínas/estatística & dados numéricos , Proteínas de Escherichia coli/química , Proteínas de Escherichia coli/genética , Proteínas de Escherichia coli/metabolismo , Modelos Moleculares , Ligação Proteica , RNA/genética , RNA Bacteriano/química , RNA Bacteriano/genética , RNA Bacteriano/metabolismo , RNA Ribossômico 16S/química , RNA Ribossômico 16S/genética , RNA Ribossômico 16S/metabolismo , Proteínas de Ligação a RNA/genética , Proteínas Ribossômicas/química , Proteínas Ribossômicas/genética , Proteínas Ribossômicas/metabolismo , Software
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA