Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 25
Filtrar
1.
Transfus Med Hemother ; 46(5): 312-325, 2019 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-31832057

RESUMO

The advent of next generation sequencing (NGS) has altered the face of genotyping the human leukocyte antigen (HLA) system in clinical, stem cell donor registry, and research contexts. NGS has led to a dramatically increased sequencing throughput at high accuracy, while being more time and cost efficient than precursor technologies. This has led to a broader and deeper profiling of the key genes in the human immunogenetic make-up. The rapid evolution of sequencing technologies is evidenced by the development of varied short-read sequencing platforms with differing read lengths and sequencing capacities to long-read sequencing platforms capable of profiling full genes without fragmentation. Concomitantly, there has been development of a diverse set of computational analyses and software tools developed to deal with the various strengths and limitations of the sequencing data generated by the different sequencing platforms. This review surveys the different modalities involved in generating NGS HLA profiling sequence data. It systematically describes various computational approaches that have been developed to achieve HLA genotyping to different degrees of resolution. At each stage, this review enumerates the drawbacks and advantages of each of the platforms and analysis approaches, thus providing a comprehensive picture of the current state of HLA genotyping technologies.

2.
Nat Methods ; 9(4): 360-2, 2012 Feb 12.
Artigo em Inglês | MEDLINE | ID: mdl-22327834

RESUMO

Whereas methods to comprehensively study cellular roles of protein-coding genes are available, techniques to systematically investigate long noncoding RNAs (lncRNAs), which have been implicated in diverse biological pathways, are limited. Here we report combined knockdown and localization analysis of noncoding RNAs (c-KLAN) that merges functional characterization and localization approaches to study lncRNAs. Using this technique we identified transcripts that regulate mouse embryonic stem cell identity.


Assuntos
Interferência de RNA , RNA não Traduzido/genética , RNA não Traduzido/metabolismo , Animais , Células-Tronco Embrionárias/metabolismo , Técnicas de Silenciamento de Genes , Hibridização in Situ Fluorescente , Camundongos
3.
Nucleic Acids Res ; 41(2): e37, 2013 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-23143104

RESUMO

Targeted genome engineering has become an important research area for diverse disciplines, with site-specific recombinases (SSRs) being among the most popular genome engineering tools. Their ability to trigger excision, integration, inversion and translocation has made SSRs an invaluable tool to manipulate DNA in vitro and in vivo. However, sophisticated strategies that combine different SSR systems are ever increasing. Hence, the demand for additional precise and efficient recombinases is dictated by the increasing complexity of the genetic studies. Here, we describe a novel site-specific recombination system designated Vika/vox. Vika originates from a degenerate bacteriophage of Vibrio coralliilyticus and shares low sequence similarity to other tyrosine recombinases, but functionally carries out a similar type of reaction. We demonstrate that Vika is highly specific in catalyzing vox recombination without recombining target sites from other SSR systems. We also compare the recombination activity of Vika/vox with other SSR systems, providing a guideline for deciding on the most suitable enzyme for a particular application and demonstrate that Vika expression does not cause cytotoxicity in mammalian cells. Our results show that Vika/vox is a novel powerful and safe instrument in the 'genetic toolbox' that can be used alone or in combination with other SSRs in heterologous hosts.


Assuntos
Recombinases/metabolismo , Recombinação Genética , Sequência de Aminoácidos , Animais , Sítios de Ligação Microbiológicos , Bacteriófagos/enzimologia , Biologia Computacional/métodos , Escherichia coli/genética , Engenharia Genética , Células HeLa , Humanos , Integrases/química , Camundongos , Modelos Moleculares , Dados de Sequência Molecular , Recombinases/química , Tirosina
4.
BMC Bioinformatics ; 15: 263, 2014 Aug 06.
Artigo em Inglês | MEDLINE | ID: mdl-25096057

RESUMO

BACKGROUND: Searching the orthologs of a given protein or DNA sequence is one of the most important and most commonly used Bioinformatics methods in Biology. Programs like BLAST or the orthology search engine Inparanoid can be used to find orthologs when the similarity between two sequences is sufficiently high. They however fail when the level of conservation is low. The detection of remotely conserved proteins oftentimes involves sophisticated manual intervention that is difficult to automate. RESULTS: Here, we introduce morFeus, a search program to find remotely conserved orthologs. Based on relaxed sequence similarity searches, morFeus selects sequences based on the similarity of their alignments to the query, tests for orthology by iterative reciprocal BLAST searches and calculates a network score for the resulting network of orthologs that is a measure of orthology independent of the E-value. Detecting remotely conserved orthologs of a protein using morFeus thus requires no manual intervention. We demonstrate the performance of morFeus by comparing it to state-of-the-art orthology resources and methods. We provide an example of remotely conserved orthologs, which were experimentally shown to be functionally equivalent in the respective organisms and therefore meet the criteria of the orthology-function conjecture. CONCLUSIONS: Based on our results, we conclude that morFeus is a powerful and specific search method for detecting remotely conserved orthologs. morFeus is freely available at http://bio.biochem.mpg.de/morfeus/. Its source code is available from Sourceforge.net (https://sourceforge.net/p/morfeus/).


Assuntos
Biologia Computacional/métodos , Sequência Conservada , Internet , Homologia de Sequência , Software , Sequência de Aminoácidos , Sequência de Bases , Interpretação Estatística de Dados , Humanos , Análise de Sequência
5.
Nat Cell Biol ; 9(12): 1401-12, 2007 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-17994010

RESUMO

Cell division is fundamental for all organisms. Here we report a genome-scale RNA-mediated interference screen in HeLa cells designed to identify human genes that are important for cell division. We have used a library of endoribonuclease-prepared short interfering RNAs for gene silencing and have used DNA content analysis to identify genes that induced cell cycle arrest or altered ploidy on silencing. Validation and secondary assays were performed to generate a nine-parameter loss-of-function phenoprint for each of the genes. These phenotypic signatures allowed the assignment of genes to specific functional classes by combining hierarchical clustering, cross-species analysis and proteomic data mining. We highlight the richness of our dataset by ascribing novel functions to genes in mitosis and cytokinesis. In particular, we identify two evolutionarily conserved transcriptional regulatory networks that govern cytokinesis. Our work provides an experimental framework from which the systematic analysis of novel genes necessary for cell division in human cells can begin.


Assuntos
Divisão Celular/fisiologia , Genoma Humano , Interferência de RNA , Perfilação da Expressão Gênica , Células HeLa , Humanos , RNA Interferente Pequeno/metabolismo
6.
Methods Mol Biol ; 2809: 157-169, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38907897

RESUMO

The Immuno Polymorphism Database (IPD) plays a pivotal role for immunogenetics. Due to technical limitations, genotyping often focuses on specific key regions like the antigen recognition domain (ARD) for HLA genotyping, and the databases are populated accordingly. More recently, though, modern next generation sequencing (NGS) assays allow using larger gene segments or even complete genes for genotyping. It is therefore essential that the databases are updated with complete genetic reference sequences to fully serve current and future applications. However, the process of manually annotating and submitting full-length allele sequences to IPD is time-consuming and error-prone, which may discourage HLA-genotyping laboratories or researchers from submitting full-length sequences of novel alleles.Here, we detail the process of preparing and submitting novel HLA, MIC, and KIR alleles to ENA and IPD using TypeLoader2, a convenient software tool developed to streamline this process by automating the sequence annotation, the creation of all necessary files, as well as parts of the submission process itself. The software is freely available from GitHub ( https://github.com/DKMS-LSL/typeloader ).


Assuntos
Alelos , Antígenos HLA , Sequenciamento de Nucleotídeos em Larga Escala , Receptores KIR , Software , Humanos , Receptores KIR/genética , Antígenos HLA/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Bases de Dados Genéticas , Biologia Computacional/métodos , Genótipo , Polimorfismo Genético
7.
Mol Biol Evol ; 28(10): 2935-48, 2011 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-21546354

RESUMO

Pentatricopeptide repeat (PPR) proteins are the largest known RNA-binding protein family, and are found in all eukaryotes, being particularly abundant in higher plants. PPR proteins localize mostly to mitochondria and chloroplasts, and many were shown to modulate organellar genome expression on the posttranscriptional level. Although the genomes of land plants encode hundreds of PPR proteins, only a few have been identified in Fungi and Metazoa. As the current PPR motif profiles are built mainly on the basis of the predominant plant sequences, they are unlikely to be optimal for detecting fungal and animal members of the family, and many putative PPR proteins in these genomes may remain undetected. In order to verify this hypothesis, we designed a hidden Markov model-based bioinformatic tool called Supervised Clustering-based Iterative Phylogenetic Hidden Markov Model algorithm for the Evaluation of tandem Repeat motif families (SCIPHER) using sequence data from orthologous clusters from available yeast genomes. This approach allowed us to assign 12 new proteins in Saccharomyces cerevisiae to the PPR family. Similarly, in other yeast species, we obtained a 5-fold increase in the detection of PPR motifs, compared with the previous tools. All the newly identified S. cerevisiae PPR proteins localize in the mitochondrion and are a part of the RNA processing interaction network. Furthermore, the yeast PPR proteins seem to undergo an accelerated divergent evolution. Analysis of single and double amino acid substitutions in the Dmr1 protein of S. cerevisiae suggests that cooperative interactions between motifs and pseudoreversion could be the force driving this rapid evolution.


Assuntos
Algoritmos , Evolução Molecular , Genômica/métodos , Cadeias de Markov , Proteínas de Ligação a RNA/genética , Proteínas de Saccharomyces cerevisiae/genética , Sequência de Aminoácidos , Análise por Conglomerados , Genoma Mitocondrial , Dados de Sequência Molecular , Filogenia , Alinhamento de Sequência
8.
Nucleic Acids Res ; 38(Web Server issue): W293-8, 2010 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-20529878

RESUMO

Site-specific recombinases have become a resourceful tool for genome engineering, allowing sophisticated in vivo DNA modifications and rearrangements, including the precise removal of integrated retroviruses from host genomes. In a recent study, a mutant form of Cre recombinase has been used to excise the provirus of a specific HIV-1 strain from the human genome. To achieve provirus excision, the Cre recombinase had to be evolved to recombine an asymmetric locus of recombination (lox)-like sequence present in the long terminal repeat (LTR) regions of a HIV-1 strain. One pre-requisite for this type of work is the identification of degenerate lox-like sites in genomic sequences. Given their nature-two inverted repeats flanking a spacer of variable length-existing search tools like BLAST or RepeatMasker perform poorly. To address this lack of available algorithms, we have developed the web-server SeLOX, which can identify degenerate lox-like sites within genomic sequences. SeLOX calculates a position weight matrix based on lox-like sequences, which is used to search genomic sequences. For computational efficiency, we transform sequences into binary space, which allows us to use a bit-wise AND Boolean operator for comparisons. Next to finding lox-like sites for Cre type recombinases in HIV LTR sequences, we have used SeLOX to identify lox-like sites in HIV LTRs for six yeast recombinases. We finally demonstrate the general usefulness of SeLOX in identifying lox-like sequences in large genomes by searching Cre type recombination sites in the entire human genome. SeLOX is freely available at http://selox.mpi-cbg.de/cgi-bin/selox/index.


Assuntos
Repetição Terminal Longa de HIV , Integrases/metabolismo , Sequências Repetidas Invertidas , Recombinases/metabolismo , Recombinação Genética , Software , Sítios de Ligação , Evolução Molecular Direcionada , Genoma Humano , Genômica , Humanos , Internet
9.
Proc Natl Acad Sci U S A ; 106(7): 2136-41, 2009 Feb 17.
Artigo em Inglês | MEDLINE | ID: mdl-19174513

RESUMO

Although the transcriptome, proteome, and interactome of several eukaryotic model organisms have been described in detail, lipidomes remain relatively uncharacterized. Using Saccharomyces cerevisiae as an example, we demonstrate that automated shotgun lipidomics analysis enabled lipidome-wide absolute quantification of individual molecular lipid species by streamlined processing of a single sample of only 2 million yeast cells. By comparative lipidomics, we achieved the absolute quantification of 250 molecular lipid species covering 21 major lipid classes. This analysis provided approximately 95% coverage of the yeast lipidome achieved with 125-fold improvement in sensitivity compared with previous approaches. Comparative lipidomics demonstrated that growth temperature and defects in lipid biosynthesis induce ripple effects throughout the molecular composition of the yeast lipidome. This work serves as a resource for molecular characterization of eukaryotic lipidomes, and establishes shotgun lipidomics as a powerful platform for complementing biochemical studies and other systems-level approaches.


Assuntos
Lipídeos/química , Espectrometria de Massas/métodos , Saccharomyces cerevisiae/metabolismo , Esfingolipídeos/química , Ácidos Graxos/química , Proteínas Fúngicas/química , Regulação Fúngica da Expressão Gênica , Genoma Fúngico , Metabolismo dos Lipídeos , Redes e Vias Metabólicas , Modelos Biológicos , Modelos Teóricos , Temperatura
10.
HLA ; 93(4): 195-202, 2019 04.
Artigo em Inglês | MEDLINE | ID: mdl-30821128

RESUMO

The Immuno Polymorphism Database (IPD) databases provide global, curated repositories for information regarding polymorphisms of genes of the immune system, thereby generating immense value for the research and clinical communities. The advent of high-throughput genotyping in immunogenetics has led to dramatically growing numbers of heretofore unknown HLA and lately also killer-cell immunoglobulin-like receptor (KIR) alleles, which are to be curated and deposited in the IPD-IMGT/HLA and IPD-KIR databases, respectively. It is highly desirable that these novel alleles are characterised and submitted in full length, and that known alleles are extended to cover the complete gene sequence. However, the manual annotation and submission of sequences to European Molecular Biology Laboratory's European Nucleotide Archive and the IPD-IMGT/HLA and IPD-KIR databases is time-consuming and error-prone. Here, we report the substantial extension of the HLA allele submission tool TypeLoader, which now also supports the annotation and submission of KIR alleles. To enable a more widespread use of this tool, we have made it available as a stand-alone application that can easily be installed on standard Windows or Linux computers. Furthermore, an internal SQLite database was added to store a wide range of metadata about each allele. This allows TypeLoader2 to be used as a lab's central information platform for the annotation, curation and submission of full-length HLA and KIR allele sequences. The software is freely available from GitHub (https://github.com/DKMS-LSL/typeloader). We hope that the increased convenience and scope of TypeLoader2 will foster the submission of more full-length sequences to the IPD-IMGT/HLA and IPD-KIR databases, ultimately promoting the use of full-length sequencing for genotyping both HLA and KIR.


Assuntos
Alelos , Bases de Dados Genéticas , Antígenos HLA/genética , Sequenciamento de Nucleotídeos em Larga Escala , Polimorfismo Genético , Receptores KIR/genética , Software , Humanos
11.
Methods Mol Biol ; 1802: 155-162, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-29858807

RESUMO

Nanopore sequencing, a paradigm change in sequencing technologies, offers a new cost-effective and scalable platform for HLA genotyping. Among the new generation of high-throughput sequencing technologies, the MinION nanopore sequencer is the first to offer a non-template-based direct DNA sensing sequencing technology. Oxford Nanopore Technologies (ONT) introduced the first version of the MinION in 2014; since then, the platform has gone through multiple iterations resulting in higher throughput and sequencing accuracy. The "what you put in is what you get" nature of the platform enables molecules to be sequenced without fragmentation. This results in ultra-long read lengths in the order of tens of kilobases enabling entire genes to be characterized with fully phased sequence information. With release R9.5, the MinION platform has reached a quality that enables HLA genotyping with minor shortcomings in long homopolymer regions. Within this chapter, we describe a protocol for sequencing and genotyping HLA Class I alleles using the MinION.


Assuntos
Técnicas de Genotipagem/métodos , Sequenciamento de Nucleotídeos em Larga Escala/instrumentação , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Antígenos de Histocompatibilidade Classe I/genética , Nanoporos , Biblioteca Gênica , Humanos
12.
BMC Bioinformatics ; 7: 466, 2006 Oct 23.
Artigo em Inglês | MEDLINE | ID: mdl-17059594

RESUMO

BACKGROUND: The functional annotation of proteins relies on published information concerning their close and remote homologues in sequence databases. Evidence for remote sequence similarity can be further strengthened by a similar biological background of the query sequence and identified database sequences. However, few tools exist so far, that provide a means to include functional information in sequence database searches. RESULTS: We present ProFAT, a web-based tool for the functional annotation of protein sequences based on remote sequence similarity. ProFAT combines sensitive sequence database search methods and a fold recognition algorithm with a simple text-mining approach. ProFAT extracts identified hits based on their biological background by keyword-mining of annotations, features and most importantly, literature associated with a sequence entry. A user-provided keyword list enables the user to specifically search for weak, but biologically relevant homologues of an input query. The ProFAT server has been evaluated using the complete set of proteins from three different domain families, including their weak relatives and could correctly identify between 90% and 100% of all domain family members studied in this context. ProFAT has furthermore been applied to a variety of proteins from different cellular contexts and we provide evidence on how ProFAT can help in functional prediction of proteins based on remotely conserved proteins. CONCLUSION: By employing sensitive database search programs as well as exploiting the functional information associated with database sequences, ProFAT can detect remote, but biologically relevant relationships between proteins and will assist researchers in the prediction of protein function based on remote homologies.


Assuntos
Sequência de Aminoácidos , Biologia Computacional/métodos , Bases de Dados de Proteínas , Internet , Proteínas/química , Software , Algoritmos , Cadeias de Markov , Estrutura Terciária de Proteína , Homologia de Sequência de Aminoácidos , Homologia Estrutural de Proteína
13.
Nucleic Acids Res ; 32(Database issue): D497-501, 2004 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-14681466

RESUMO

The rapid pace at which genomic and proteomic data is being generated necessitates the development of tools and resources for managing data that allow integration of information from disparate sources. The Human Protein Reference Database (http://www.hprd.org) is a web-based resource based on open source technologies for protein information about several aspects of human proteins including protein-protein interactions, post-translational modifications, enzyme-substrate relationships and disease associations. This information was derived manually by a critical reading of the published literature by expert biologists and through bioinformatics analyses of the protein sequence. This database will assist in biomedical discoveries by serving as a resource of genomic and proteomic information and providing an integrated view of sequence, structure, function and protein networks in health and disease.


Assuntos
Bases de Dados de Proteínas , Proteínas/metabolismo , Proteômica , Biologia Computacional , Doença , Genômica , Humanos , Armazenamento e Recuperação da Informação , Internet , Ligação Proteica , Processamento de Proteína Pós-Traducional , Proteínas/química , Proteínas/genética , Proteoma/química , Proteoma/genética , Proteoma/metabolismo , Especificidade por Substrato , Vocabulário Controlado
14.
Nat Biotechnol ; 34(4): 401-9, 2016 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-26900663

RESUMO

Current combination antiretroviral therapies (cART) efficiently suppress HIV-1 reproduction in humans, but the virus persists as integrated proviral reservoirs in small numbers of cells. To generate an antiviral agent capable of eradicating the provirus from infected cells, we employed 145 cycles of substrate-linked directed evolution to evolve a recombinase (Brec1) that site-specifically recognizes a 34-bp sequence present in the long terminal repeats (LTRs) of the majority of the clinically relevant HIV-1 strains and subtypes. Brec1 efficiently, precisely and safely removes the integrated provirus from infected cells and is efficacious on clinical HIV-1 isolates in vitro and in vivo, including in mice humanized with patient-derived cells. Our data suggest that Brec1 has potential for clinical application as a curative HIV-1 therapy.


Assuntos
Antivirais/farmacologia , Evolução Molecular Direcionada/métodos , Infecções por HIV/virologia , HIV-1/efeitos dos fármacos , Provírus/efeitos dos fármacos , Recombinases/farmacologia , Integração Viral/efeitos dos fármacos , Animais , Antivirais/metabolismo , Sequência de Bases , Células Cultivadas , HIV-1/genética , Humanos , Camundongos , Dados de Sequência Molecular , Provírus/genética , Recombinases/metabolismo , Integração Viral/genética
15.
Elife ; 42015 Apr 02.
Artigo em Inglês | MEDLINE | ID: mdl-25838129

RESUMO

mRNA localization is critical for eukaryotic cells and affects numerous transcripts, yet how cells regulate distribution of many mRNAs to their subcellular destinations is still unknown. We combined transcriptomics and systematic imaging to determine the tissue-specific expression and subcellular distribution of 5862 mRNAs during Drosophila oogenesis. mRNA localization is widespread in the ovary and detectable in all of its cell types-the somatic epithelial, the nurse cells, and the oocyte. Genes defined by a common RNA localization share distinct gene features and differ in expression level, 3'UTR length and sequence conservation from unlocalized mRNAs. Comparison of mRNA localizations in different contexts revealed that localization of individual mRNAs changes over time in the oocyte and between ovarian and embryonic cell types. This genome scale image-based resource (Dresden Ovary Table, DOT, http://tomancak-srv1.mpi-cbg.de/DOT/main.html) enables the transition from mechanistic dissection of singular mRNA localization events towards global understanding of how mRNAs transcribed in the nucleus distribute in cells.


Assuntos
Drosophila melanogaster/crescimento & desenvolvimento , Drosophila melanogaster/genética , Imageamento Tridimensional , Transporte de RNA/genética , Animais , Núcleo Celular/metabolismo , Feminino , Regulação da Expressão Gênica no Desenvolvimento , Hibridização in Situ Fluorescente , Oogênese/genética , Ovário/metabolismo , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Fatores de Tempo
16.
BMC Bioinformatics ; 5: 43, 2004 Apr 20.
Artigo em Inglês | MEDLINE | ID: mdl-15099404

RESUMO

BACKGROUND: The explosion in biological information creates the need for databases that are easy to develop, easy to maintain and can be easily manipulated by annotators who are most likely to be biologists. However, deployment of scalable and extensible databases is not an easy task and generally requires substantial expertise in database development. RESULTS: BioBuilder is a Zope-based software tool that was developed to facilitate intuitive creation of protein databases. Protein data can be entered and annotated through web forms along with the flexibility to add customized annotation features to protein entries. A built-in review system permits a global team of scientists to coordinate their annotation efforts. We have already used BioBuilder to develop Human Protein Reference Database http://www.hprd.org, a comprehensive annotated repository of the human proteome. The data can be exported in the extensible markup language (XML) format, which is rapidly becoming as the standard format for data exchange. CONCLUSIONS: As the proteomic data for several organisms begins to accumulate, BioBuilder will prove to be an invaluable platform for functional annotation and development of customizable protein centric databases. BioBuilder is open source and is available under the terms of LGPL.


Assuntos
Sistemas de Gerenciamento de Base de Dados , Bases de Dados de Proteínas , Proteínas/fisiologia , Software , Biologia Computacional/métodos , Biologia Computacional/normas , Bases de Dados de Proteínas/normas , Humanos , Internet , Proteínas/normas , Software/normas , Design de Software
17.
J Int AIDS Soc ; 17(4 Suppl 3): 19706, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25397454

RESUMO

Current drugs against HIV can suppress the progression to AIDS but cannot clear the patient from the virus. Because of potential side effects of these drugs and the possible development of drug resistance, finding a cure for HIV infection remains a high priority of HIV/AIDS research. We recently generated a recombinase (termed Tre) tailored to efficiently eradicate the provirus from the host genome of HIV-1 infected cells by specifically targeting a sequence that is present in the long terminal repeats (LTRs) of the viral DNA [1]. In vivo analyses in HIV-infected humanized mice demonstrated highly significant antiviral effects of Tre recombinase [2]. However, the fact that Tre recognizes a particular HIV-1 subtype A strain may limit its broad therapeutic application. To advance our Tre-based strategy towards a universally efficient cure, we have engineered a new, universal recombinase (uTre) applicable to the majority of HIV-1 infections by the various virus strains and subtypes. We employed the search tool SeLOX [3] in order to find a well-conserved HIV-1 proviral sequence that could serve as target site for a universal Tre from sequences compiled in the Los Alamos HIV Sequence Database. We selected a candidate (termed loxLTRu) with a mean conservation rate of 94% throughout the major HIV-1 subtype groups A, B and C. We applied loxLTRu as substrate in our established substrate-linked protein evolution (SLiPE) process [4] and evolved the uTre recombinase in 142 evolution cycles. Highly specific enzymatic activity on loxLTRu is demonstrated for uTre in both Escherichia coli and human cells. Naturally occurring viral variants with single mutations within the loxLTRu sequence are also shown to be efficiently targeted by uTre, further increasing the range of applicability of the recombinase. Potential off-target sites in the human genome are not recombined by uTre. Furthermore, uTre expression in primary human T cells shows no obvious Tre-related cytopathic or genotoxic effects. Finally, uTre expressing mice show no undesired phenotypes during their normal lifespan. We have developed a broad-range HIV-1 LTR specific recombinase that has the potential to be effective against the vast majority of HIV-1 strains and to cure HIV-1 infected cells from the infection. These results strongly encouraged us in our confidence that a Tre recombinase-mediated HIV eradication strategy may become a valuable component of a future therapy for HIV-infected patients.

18.
Methods Mol Biol ; 942: 193-204, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-23027053

RESUMO

RNA interference (RNAi) has grown to be one of the main techniques for loss-of-function studies, leading to the elucidation of biological function of genes in various cellular systems and model organisms. While for many invertebrates such as Drosophila melanogaster (D. melanogaster) and Caenorhabditis elegans (C. elegans) long double-stranded RNA (dsRNA) can directly be used to induce a RNAi response, chemically synthesized small interfering RNAs (siRNAs) are typically employed in mammalian cells to avoid an interferon-like response triggered by long dsRNA (Reynolds et al., RNA 12:988-993, 2006). However, siRNAs are expensive and beset with unintentional gene targeting effects (off-targets) confounding the analysis of results from such studies. We, and others, have developed an alternative technology for RNAi in mammalian cells, termed endoribonuclease-prepared siRNA (esiRNA), which is based on the enzymatic generation of siRNA pools by digestion of long dsRNAs with recombinant RNase III in vitro (Yang et al., Proc Natl Acad Sci USA 99: 9942-9947, 2002; Myers et al., Nat Biotechnol 21:324-328; 2003). This technology has proven to be cost-efficient and reliable. Furthermore, several studies have demonstrated that complex pools of siRNAs, as inherent in esiRNAs, which target one transcript reduce off-target effects (Myers et al., J RNAi Gene Silencing 2:181, 2006; Kittler et al., Nat Methods 4:337-344, 2007). Within this chapter we describe design criteria for the generation of target-optimized esiRNAs.


Assuntos
Endorribonucleases/metabolismo , Técnicas de Silenciamento de Genes/métodos , RNA Interferente Pequeno/genética , Regiões 3' não Traduzidas/genética , Biologia Computacional , RNA Interferente Pequeno/metabolismo
19.
PLoS One ; 6(3): e17568, 2011 Mar 10.
Artigo em Inglês | MEDLINE | ID: mdl-21423752

RESUMO

Conserved domains in proteins are one of the major sources of functional information for experimental design and genome-level annotation. Though search tools for conserved domain databases such as Hidden Markov Models (HMMs) are sensitive in detecting conserved domains in proteins when they share sufficient sequence similarity, they tend to miss more divergent family members, as they lack a reliable statistical framework for the detection of low sequence similarity. We have developed a greatly improved HMMerThread algorithm that can detect remotely conserved domains in highly divergent sequences. HMMerThread combines relaxed conserved domain searches with fold recognition to eliminate false positive, sequence-based identifications. With an accuracy of 90%, our software is able to automatically predict highly divergent members of conserved domain families with an associated 3-dimensional structure. We give additional confidence to our predictions by validation across species. We have run HMMerThread searches on eight proteomes including human and present a rich resource of remotely conserved domains, which adds significantly to the functional annotation of entire proteomes. We find ∼4500 cross-species validated, remotely conserved domain predictions in the human proteome alone. As an example, we find a DNA-binding domain in the C-terminal part of the A-kinase anchor protein 10 (AKAP10), a PKA adaptor that has been implicated in cardiac arrhythmias and premature cardiac death, which upon stress likely translocates from mitochondria to the nucleus/nucleolus. Based on our prediction, we propose that with this HLH-domain, AKAP10 is involved in the transcriptional control of stress response. Further remotely conserved domains we discuss are examples from areas such as sporulation, chromosome segregation and signalling during immune response. The HMMerThread algorithm is able to automatically detect the presence of remotely conserved domains in proteins based on weak sequence similarity. Our predictions open up new avenues for biological and medical studies. Genome-wide HMMerThread domains are available at http://vm1-hmmerthread.age.mpg.de.


Assuntos
Biologia Computacional/métodos , Sequência Conservada , Bases de Dados de Proteínas , Genoma/genética , Estrutura Secundária de Proteína , Estrutura Terciária de Proteína , Software , Algoritmos , Sequência de Aminoácidos , Animais , Divisão Celular , Doença , Humanos , Dados de Sequência Molecular , Ligação Proteica , Proteoma/análise , Proteoma/química , Alinhamento de Sequência , Análise de Sequência de Proteína
20.
J Proteome Res ; 7(8): 3382-95, 2008 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-18558732

RESUMO

Only a small fraction of spectra acquired in LC-MS/MS runs matches peptides from target proteins upon database searches. The remaining, operationally termed background, spectra originate from a variety of poorly controlled sources and affect the throughput and confidence of database searches. Here, we report an algorithm and its software implementation that rapidly removes background spectra, regardless of their precise origin. The method estimates the dissimilarity distance between screened MS/MS spectra and unannotated spectra from a partially redundant background library compiled from several control and blank runs. Filtering MS/MS queries enhanced the protein identification capacity when searches lacked spectrum to sequence matching specificity. In sequence-similarity searches it reduced by, on average, 30-fold the number of orphan hits, which were not explicitly related to background protein contaminants and required manual validation. Removing high quality background MS/MS spectra, while preserving in the data set the genuine spectra from target proteins, decreased the false positive rate of stringent database searches and improved the identification of low-abundance proteins.


Assuntos
Proteínas/análise , Animais , Cromatografia Líquida , Bases de Dados Factuais , Células HeLa , Humanos , Proteínas de Insetos/análise , Proteínas de Plantas/análise , Proteômica , Software , Espectrometria de Massas em Tandem , Traqueófitas , Triatoma
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA