Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 25
Filtrar
Más filtros












Base de datos
Intervalo de año de publicación
1.
Methods Mol Biol ; 2809: 157-169, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38907897

RESUMEN

The Immuno Polymorphism Database (IPD) plays a pivotal role for immunogenetics. Due to technical limitations, genotyping often focuses on specific key regions like the antigen recognition domain (ARD) for HLA genotyping, and the databases are populated accordingly. More recently, though, modern next generation sequencing (NGS) assays allow using larger gene segments or even complete genes for genotyping. It is therefore essential that the databases are updated with complete genetic reference sequences to fully serve current and future applications. However, the process of manually annotating and submitting full-length allele sequences to IPD is time-consuming and error-prone, which may discourage HLA-genotyping laboratories or researchers from submitting full-length sequences of novel alleles.Here, we detail the process of preparing and submitting novel HLA, MIC, and KIR alleles to ENA and IPD using TypeLoader2, a convenient software tool developed to streamline this process by automating the sequence annotation, the creation of all necessary files, as well as parts of the submission process itself. The software is freely available from GitHub ( https://github.com/DKMS-LSL/typeloader ).


Asunto(s)
Alelos , Antígenos HLA , Secuenciación de Nucleótidos de Alto Rendimiento , Receptores KIR , Programas Informáticos , Humanos , Receptores KIR/genética , Antígenos HLA/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Bases de Datos Genéticas , Biología Computacional/métodos , Genotipo , Polimorfismo Genético
2.
Transfus Med Hemother ; 46(5): 312-325, 2019 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-31832057

RESUMEN

The advent of next generation sequencing (NGS) has altered the face of genotyping the human leukocyte antigen (HLA) system in clinical, stem cell donor registry, and research contexts. NGS has led to a dramatically increased sequencing throughput at high accuracy, while being more time and cost efficient than precursor technologies. This has led to a broader and deeper profiling of the key genes in the human immunogenetic make-up. The rapid evolution of sequencing technologies is evidenced by the development of varied short-read sequencing platforms with differing read lengths and sequencing capacities to long-read sequencing platforms capable of profiling full genes without fragmentation. Concomitantly, there has been development of a diverse set of computational analyses and software tools developed to deal with the various strengths and limitations of the sequencing data generated by the different sequencing platforms. This review surveys the different modalities involved in generating NGS HLA profiling sequence data. It systematically describes various computational approaches that have been developed to achieve HLA genotyping to different degrees of resolution. At each stage, this review enumerates the drawbacks and advantages of each of the platforms and analysis approaches, thus providing a comprehensive picture of the current state of HLA genotyping technologies.

3.
HLA ; 93(4): 195-202, 2019 04.
Artículo en Inglés | MEDLINE | ID: mdl-30821128

RESUMEN

The Immuno Polymorphism Database (IPD) databases provide global, curated repositories for information regarding polymorphisms of genes of the immune system, thereby generating immense value for the research and clinical communities. The advent of high-throughput genotyping in immunogenetics has led to dramatically growing numbers of heretofore unknown HLA and lately also killer-cell immunoglobulin-like receptor (KIR) alleles, which are to be curated and deposited in the IPD-IMGT/HLA and IPD-KIR databases, respectively. It is highly desirable that these novel alleles are characterised and submitted in full length, and that known alleles are extended to cover the complete gene sequence. However, the manual annotation and submission of sequences to European Molecular Biology Laboratory's European Nucleotide Archive and the IPD-IMGT/HLA and IPD-KIR databases is time-consuming and error-prone. Here, we report the substantial extension of the HLA allele submission tool TypeLoader, which now also supports the annotation and submission of KIR alleles. To enable a more widespread use of this tool, we have made it available as a stand-alone application that can easily be installed on standard Windows or Linux computers. Furthermore, an internal SQLite database was added to store a wide range of metadata about each allele. This allows TypeLoader2 to be used as a lab's central information platform for the annotation, curation and submission of full-length HLA and KIR allele sequences. The software is freely available from GitHub (https://github.com/DKMS-LSL/typeloader). We hope that the increased convenience and scope of TypeLoader2 will foster the submission of more full-length sequences to the IPD-IMGT/HLA and IPD-KIR databases, ultimately promoting the use of full-length sequencing for genotyping both HLA and KIR.


Asunto(s)
Alelos , Bases de Datos Genéticas , Antígenos HLA/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Polimorfismo Genético , Receptores KIR/genética , Programas Informáticos , Humanos
4.
Methods Mol Biol ; 1802: 155-162, 2018.
Artículo en Inglés | MEDLINE | ID: mdl-29858807

RESUMEN

Nanopore sequencing, a paradigm change in sequencing technologies, offers a new cost-effective and scalable platform for HLA genotyping. Among the new generation of high-throughput sequencing technologies, the MinION nanopore sequencer is the first to offer a non-template-based direct DNA sensing sequencing technology. Oxford Nanopore Technologies (ONT) introduced the first version of the MinION in 2014; since then, the platform has gone through multiple iterations resulting in higher throughput and sequencing accuracy. The "what you put in is what you get" nature of the platform enables molecules to be sequenced without fragmentation. This results in ultra-long read lengths in the order of tens of kilobases enabling entire genes to be characterized with fully phased sequence information. With release R9.5, the MinION platform has reached a quality that enables HLA genotyping with minor shortcomings in long homopolymer regions. Within this chapter, we describe a protocol for sequencing and genotyping HLA Class I alleles using the MinION.


Asunto(s)
Técnicas de Genotipaje/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/instrumentación , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Antígenos de Histocompatibilidad Clase I/genética , Nanoporos , Biblioteca de Genes , Humanos
5.
Nat Biotechnol ; 34(4): 401-9, 2016 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-26900663

RESUMEN

Current combination antiretroviral therapies (cART) efficiently suppress HIV-1 reproduction in humans, but the virus persists as integrated proviral reservoirs in small numbers of cells. To generate an antiviral agent capable of eradicating the provirus from infected cells, we employed 145 cycles of substrate-linked directed evolution to evolve a recombinase (Brec1) that site-specifically recognizes a 34-bp sequence present in the long terminal repeats (LTRs) of the majority of the clinically relevant HIV-1 strains and subtypes. Brec1 efficiently, precisely and safely removes the integrated provirus from infected cells and is efficacious on clinical HIV-1 isolates in vitro and in vivo, including in mice humanized with patient-derived cells. Our data suggest that Brec1 has potential for clinical application as a curative HIV-1 therapy.


Asunto(s)
Antivirales/farmacología , Evolución Molecular Dirigida/métodos , Infecciones por VIH/virología , VIH-1/efectos de los fármacos , Provirus/efectos de los fármacos , Recombinasas/farmacología , Integración Viral/efectos de los fármacos , Animales , Antivirales/metabolismo , Secuencia de Bases , Células Cultivadas , VIH-1/genética , Humanos , Ratones , Datos de Secuencia Molecular , Provirus/genética , Recombinasas/metabolismo , Integración Viral/genética
6.
Elife ; 42015 Apr 02.
Artículo en Inglés | MEDLINE | ID: mdl-25838129

RESUMEN

mRNA localization is critical for eukaryotic cells and affects numerous transcripts, yet how cells regulate distribution of many mRNAs to their subcellular destinations is still unknown. We combined transcriptomics and systematic imaging to determine the tissue-specific expression and subcellular distribution of 5862 mRNAs during Drosophila oogenesis. mRNA localization is widespread in the ovary and detectable in all of its cell types-the somatic epithelial, the nurse cells, and the oocyte. Genes defined by a common RNA localization share distinct gene features and differ in expression level, 3'UTR length and sequence conservation from unlocalized mRNAs. Comparison of mRNA localizations in different contexts revealed that localization of individual mRNAs changes over time in the oocyte and between ovarian and embryonic cell types. This genome scale image-based resource (Dresden Ovary Table, DOT, http://tomancak-srv1.mpi-cbg.de/DOT/main.html) enables the transition from mechanistic dissection of singular mRNA localization events towards global understanding of how mRNAs transcribed in the nucleus distribute in cells.


Asunto(s)
Drosophila melanogaster/crecimiento & desarrollo , Drosophila melanogaster/genética , Imagenología Tridimensional , Transporte de ARN/genética , Animales , Núcleo Celular/metabolismo , Femenino , Regulación del Desarrollo de la Expresión Génica , Hibridación Fluorescente in Situ , Oogénesis/genética , Ovario/metabolismo , ARN Mensajero/genética , ARN Mensajero/metabolismo , Factores de Tiempo
7.
J Int AIDS Soc ; 17(4 Suppl 3): 19706, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-25397454

RESUMEN

Current drugs against HIV can suppress the progression to AIDS but cannot clear the patient from the virus. Because of potential side effects of these drugs and the possible development of drug resistance, finding a cure for HIV infection remains a high priority of HIV/AIDS research. We recently generated a recombinase (termed Tre) tailored to efficiently eradicate the provirus from the host genome of HIV-1 infected cells by specifically targeting a sequence that is present in the long terminal repeats (LTRs) of the viral DNA [1]. In vivo analyses in HIV-infected humanized mice demonstrated highly significant antiviral effects of Tre recombinase [2]. However, the fact that Tre recognizes a particular HIV-1 subtype A strain may limit its broad therapeutic application. To advance our Tre-based strategy towards a universally efficient cure, we have engineered a new, universal recombinase (uTre) applicable to the majority of HIV-1 infections by the various virus strains and subtypes. We employed the search tool SeLOX [3] in order to find a well-conserved HIV-1 proviral sequence that could serve as target site for a universal Tre from sequences compiled in the Los Alamos HIV Sequence Database. We selected a candidate (termed loxLTRu) with a mean conservation rate of 94% throughout the major HIV-1 subtype groups A, B and C. We applied loxLTRu as substrate in our established substrate-linked protein evolution (SLiPE) process [4] and evolved the uTre recombinase in 142 evolution cycles. Highly specific enzymatic activity on loxLTRu is demonstrated for uTre in both Escherichia coli and human cells. Naturally occurring viral variants with single mutations within the loxLTRu sequence are also shown to be efficiently targeted by uTre, further increasing the range of applicability of the recombinase. Potential off-target sites in the human genome are not recombined by uTre. Furthermore, uTre expression in primary human T cells shows no obvious Tre-related cytopathic or genotoxic effects. Finally, uTre expressing mice show no undesired phenotypes during their normal lifespan. We have developed a broad-range HIV-1 LTR specific recombinase that has the potential to be effective against the vast majority of HIV-1 strains and to cure HIV-1 infected cells from the infection. These results strongly encouraged us in our confidence that a Tre recombinase-mediated HIV eradication strategy may become a valuable component of a future therapy for HIV-infected patients.

8.
BMC Bioinformatics ; 15: 263, 2014 Aug 06.
Artículo en Inglés | MEDLINE | ID: mdl-25096057

RESUMEN

BACKGROUND: Searching the orthologs of a given protein or DNA sequence is one of the most important and most commonly used Bioinformatics methods in Biology. Programs like BLAST or the orthology search engine Inparanoid can be used to find orthologs when the similarity between two sequences is sufficiently high. They however fail when the level of conservation is low. The detection of remotely conserved proteins oftentimes involves sophisticated manual intervention that is difficult to automate. RESULTS: Here, we introduce morFeus, a search program to find remotely conserved orthologs. Based on relaxed sequence similarity searches, morFeus selects sequences based on the similarity of their alignments to the query, tests for orthology by iterative reciprocal BLAST searches and calculates a network score for the resulting network of orthologs that is a measure of orthology independent of the E-value. Detecting remotely conserved orthologs of a protein using morFeus thus requires no manual intervention. We demonstrate the performance of morFeus by comparing it to state-of-the-art orthology resources and methods. We provide an example of remotely conserved orthologs, which were experimentally shown to be functionally equivalent in the respective organisms and therefore meet the criteria of the orthology-function conjecture. CONCLUSIONS: Based on our results, we conclude that morFeus is a powerful and specific search method for detecting remotely conserved orthologs. morFeus is freely available at http://bio.biochem.mpg.de/morfeus/. Its source code is available from Sourceforge.net (https://sourceforge.net/p/morfeus/).


Asunto(s)
Biología Computacional/métodos , Secuencia Conservada , Internet , Homología de Secuencia , Programas Informáticos , Secuencia de Aminoácidos , Secuencia de Bases , Interpretación Estadística de Datos , Humanos , Análisis de Secuencia
9.
Methods Mol Biol ; 942: 193-204, 2013.
Artículo en Inglés | MEDLINE | ID: mdl-23027053

RESUMEN

RNA interference (RNAi) has grown to be one of the main techniques for loss-of-function studies, leading to the elucidation of biological function of genes in various cellular systems and model organisms. While for many invertebrates such as Drosophila melanogaster (D. melanogaster) and Caenorhabditis elegans (C. elegans) long double-stranded RNA (dsRNA) can directly be used to induce a RNAi response, chemically synthesized small interfering RNAs (siRNAs) are typically employed in mammalian cells to avoid an interferon-like response triggered by long dsRNA (Reynolds et al., RNA 12:988-993, 2006). However, siRNAs are expensive and beset with unintentional gene targeting effects (off-targets) confounding the analysis of results from such studies. We, and others, have developed an alternative technology for RNAi in mammalian cells, termed endoribonuclease-prepared siRNA (esiRNA), which is based on the enzymatic generation of siRNA pools by digestion of long dsRNAs with recombinant RNase III in vitro (Yang et al., Proc Natl Acad Sci USA 99: 9942-9947, 2002; Myers et al., Nat Biotechnol 21:324-328; 2003). This technology has proven to be cost-efficient and reliable. Furthermore, several studies have demonstrated that complex pools of siRNAs, as inherent in esiRNAs, which target one transcript reduce off-target effects (Myers et al., J RNAi Gene Silencing 2:181, 2006; Kittler et al., Nat Methods 4:337-344, 2007). Within this chapter we describe design criteria for the generation of target-optimized esiRNAs.


Asunto(s)
Endorribonucleasas/metabolismo , Técnicas de Silenciamiento del Gen/métodos , ARN Interferente Pequeño/genética , Regiones no Traducidas 3'/genética , Biología Computacional , ARN Interferente Pequeño/metabolismo
10.
Nucleic Acids Res ; 41(2): e37, 2013 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-23143104

RESUMEN

Targeted genome engineering has become an important research area for diverse disciplines, with site-specific recombinases (SSRs) being among the most popular genome engineering tools. Their ability to trigger excision, integration, inversion and translocation has made SSRs an invaluable tool to manipulate DNA in vitro and in vivo. However, sophisticated strategies that combine different SSR systems are ever increasing. Hence, the demand for additional precise and efficient recombinases is dictated by the increasing complexity of the genetic studies. Here, we describe a novel site-specific recombination system designated Vika/vox. Vika originates from a degenerate bacteriophage of Vibrio coralliilyticus and shares low sequence similarity to other tyrosine recombinases, but functionally carries out a similar type of reaction. We demonstrate that Vika is highly specific in catalyzing vox recombination without recombining target sites from other SSR systems. We also compare the recombination activity of Vika/vox with other SSR systems, providing a guideline for deciding on the most suitable enzyme for a particular application and demonstrate that Vika expression does not cause cytotoxicity in mammalian cells. Our results show that Vika/vox is a novel powerful and safe instrument in the 'genetic toolbox' that can be used alone or in combination with other SSRs in heterologous hosts.


Asunto(s)
Recombinasas/metabolismo , Recombinación Genética , Secuencia de Aminoácidos , Animales , Sitios de Ligazón Microbiológica , Bacteriófagos/enzimología , Biología Computacional/métodos , Escherichia coli/genética , Ingeniería Genética , Células HeLa , Humanos , Integrasas/química , Ratones , Modelos Moleculares , Datos de Secuencia Molecular , Recombinasas/química , Tirosina
11.
Nat Methods ; 9(4): 360-2, 2012 Feb 12.
Artículo en Inglés | MEDLINE | ID: mdl-22327834

RESUMEN

Whereas methods to comprehensively study cellular roles of protein-coding genes are available, techniques to systematically investigate long noncoding RNAs (lncRNAs), which have been implicated in diverse biological pathways, are limited. Here we report combined knockdown and localization analysis of noncoding RNAs (c-KLAN) that merges functional characterization and localization approaches to study lncRNAs. Using this technique we identified transcripts that regulate mouse embryonic stem cell identity.


Asunto(s)
Interferencia de ARN , ARN no Traducido/genética , ARN no Traducido/metabolismo , Animales , Células Madre Embrionarias/metabolismo , Técnicas de Silenciamiento del Gen , Hibridación Fluorescente in Situ , Ratones
12.
Mol Biol Evol ; 28(10): 2935-48, 2011 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-21546354

RESUMEN

Pentatricopeptide repeat (PPR) proteins are the largest known RNA-binding protein family, and are found in all eukaryotes, being particularly abundant in higher plants. PPR proteins localize mostly to mitochondria and chloroplasts, and many were shown to modulate organellar genome expression on the posttranscriptional level. Although the genomes of land plants encode hundreds of PPR proteins, only a few have been identified in Fungi and Metazoa. As the current PPR motif profiles are built mainly on the basis of the predominant plant sequences, they are unlikely to be optimal for detecting fungal and animal members of the family, and many putative PPR proteins in these genomes may remain undetected. In order to verify this hypothesis, we designed a hidden Markov model-based bioinformatic tool called Supervised Clustering-based Iterative Phylogenetic Hidden Markov Model algorithm for the Evaluation of tandem Repeat motif families (SCIPHER) using sequence data from orthologous clusters from available yeast genomes. This approach allowed us to assign 12 new proteins in Saccharomyces cerevisiae to the PPR family. Similarly, in other yeast species, we obtained a 5-fold increase in the detection of PPR motifs, compared with the previous tools. All the newly identified S. cerevisiae PPR proteins localize in the mitochondrion and are a part of the RNA processing interaction network. Furthermore, the yeast PPR proteins seem to undergo an accelerated divergent evolution. Analysis of single and double amino acid substitutions in the Dmr1 protein of S. cerevisiae suggests that cooperative interactions between motifs and pseudoreversion could be the force driving this rapid evolution.


Asunto(s)
Algoritmos , Evolución Molecular , Genómica/métodos , Cadenas de Markov , Proteínas de Unión al ARN/genética , Proteínas de Saccharomyces cerevisiae/genética , Secuencia de Aminoácidos , Análisis por Conglomerados , Genoma Mitocondrial , Datos de Secuencia Molecular , Filogenia , Alineación de Secuencia
13.
PLoS One ; 6(3): e17568, 2011 Mar 10.
Artículo en Inglés | MEDLINE | ID: mdl-21423752

RESUMEN

Conserved domains in proteins are one of the major sources of functional information for experimental design and genome-level annotation. Though search tools for conserved domain databases such as Hidden Markov Models (HMMs) are sensitive in detecting conserved domains in proteins when they share sufficient sequence similarity, they tend to miss more divergent family members, as they lack a reliable statistical framework for the detection of low sequence similarity. We have developed a greatly improved HMMerThread algorithm that can detect remotely conserved domains in highly divergent sequences. HMMerThread combines relaxed conserved domain searches with fold recognition to eliminate false positive, sequence-based identifications. With an accuracy of 90%, our software is able to automatically predict highly divergent members of conserved domain families with an associated 3-dimensional structure. We give additional confidence to our predictions by validation across species. We have run HMMerThread searches on eight proteomes including human and present a rich resource of remotely conserved domains, which adds significantly to the functional annotation of entire proteomes. We find ∼4500 cross-species validated, remotely conserved domain predictions in the human proteome alone. As an example, we find a DNA-binding domain in the C-terminal part of the A-kinase anchor protein 10 (AKAP10), a PKA adaptor that has been implicated in cardiac arrhythmias and premature cardiac death, which upon stress likely translocates from mitochondria to the nucleus/nucleolus. Based on our prediction, we propose that with this HLH-domain, AKAP10 is involved in the transcriptional control of stress response. Further remotely conserved domains we discuss are examples from areas such as sporulation, chromosome segregation and signalling during immune response. The HMMerThread algorithm is able to automatically detect the presence of remotely conserved domains in proteins based on weak sequence similarity. Our predictions open up new avenues for biological and medical studies. Genome-wide HMMerThread domains are available at http://vm1-hmmerthread.age.mpg.de.


Asunto(s)
Biología Computacional/métodos , Secuencia Conservada , Bases de Datos de Proteínas , Genoma/genética , Estructura Secundaria de Proteína , Estructura Terciaria de Proteína , Programas Informáticos , Algoritmos , Secuencia de Aminoácidos , Animales , División Celular , Enfermedad , Humanos , Datos de Secuencia Molecular , Unión Proteica , Proteoma/análisis , Proteoma/química , Alineación de Secuencia , Análisis de Secuencia de Proteína
14.
Nucleic Acids Res ; 38(Web Server issue): W293-8, 2010 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-20529878

RESUMEN

Site-specific recombinases have become a resourceful tool for genome engineering, allowing sophisticated in vivo DNA modifications and rearrangements, including the precise removal of integrated retroviruses from host genomes. In a recent study, a mutant form of Cre recombinase has been used to excise the provirus of a specific HIV-1 strain from the human genome. To achieve provirus excision, the Cre recombinase had to be evolved to recombine an asymmetric locus of recombination (lox)-like sequence present in the long terminal repeat (LTR) regions of a HIV-1 strain. One pre-requisite for this type of work is the identification of degenerate lox-like sites in genomic sequences. Given their nature-two inverted repeats flanking a spacer of variable length-existing search tools like BLAST or RepeatMasker perform poorly. To address this lack of available algorithms, we have developed the web-server SeLOX, which can identify degenerate lox-like sites within genomic sequences. SeLOX calculates a position weight matrix based on lox-like sequences, which is used to search genomic sequences. For computational efficiency, we transform sequences into binary space, which allows us to use a bit-wise AND Boolean operator for comparisons. Next to finding lox-like sites for Cre type recombinases in HIV LTR sequences, we have used SeLOX to identify lox-like sites in HIV LTRs for six yeast recombinases. We finally demonstrate the general usefulness of SeLOX in identifying lox-like sequences in large genomes by searching Cre type recombination sites in the entire human genome. SeLOX is freely available at http://selox.mpi-cbg.de/cgi-bin/selox/index.


Asunto(s)
Duplicado del Terminal Largo de VIH , Integrasas/metabolismo , Secuencias Invertidas Repetidas , Recombinasas/metabolismo , Recombinación Genética , Programas Informáticos , Sitios de Unión , Evolución Molecular Dirigida , Genoma Humano , Genómica , Humanos , Internet
15.
Proc Natl Acad Sci U S A ; 106(7): 2136-41, 2009 Feb 17.
Artículo en Inglés | MEDLINE | ID: mdl-19174513

RESUMEN

Although the transcriptome, proteome, and interactome of several eukaryotic model organisms have been described in detail, lipidomes remain relatively uncharacterized. Using Saccharomyces cerevisiae as an example, we demonstrate that automated shotgun lipidomics analysis enabled lipidome-wide absolute quantification of individual molecular lipid species by streamlined processing of a single sample of only 2 million yeast cells. By comparative lipidomics, we achieved the absolute quantification of 250 molecular lipid species covering 21 major lipid classes. This analysis provided approximately 95% coverage of the yeast lipidome achieved with 125-fold improvement in sensitivity compared with previous approaches. Comparative lipidomics demonstrated that growth temperature and defects in lipid biosynthesis induce ripple effects throughout the molecular composition of the yeast lipidome. This work serves as a resource for molecular characterization of eukaryotic lipidomes, and establishes shotgun lipidomics as a powerful platform for complementing biochemical studies and other systems-level approaches.


Asunto(s)
Lípidos/química , Espectrometría de Masas/métodos , Saccharomyces cerevisiae/metabolismo , Esfingolípidos/química , Ácidos Grasos/química , Proteínas Fúngicas/química , Regulación Fúngica de la Expresión Génica , Genoma Fúngico , Metabolismo de los Lípidos , Redes y Vías Metabólicas , Modelos Biológicos , Modelos Teóricos , Temperatura
16.
J Proteome Res ; 7(8): 3382-95, 2008 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-18558732

RESUMEN

Only a small fraction of spectra acquired in LC-MS/MS runs matches peptides from target proteins upon database searches. The remaining, operationally termed background, spectra originate from a variety of poorly controlled sources and affect the throughput and confidence of database searches. Here, we report an algorithm and its software implementation that rapidly removes background spectra, regardless of their precise origin. The method estimates the dissimilarity distance between screened MS/MS spectra and unannotated spectra from a partially redundant background library compiled from several control and blank runs. Filtering MS/MS queries enhanced the protein identification capacity when searches lacked spectrum to sequence matching specificity. In sequence-similarity searches it reduced by, on average, 30-fold the number of orphan hits, which were not explicitly related to background protein contaminants and required manual validation. Removing high quality background MS/MS spectra, while preserving in the data set the genuine spectra from target proteins, decreased the false positive rate of stringent database searches and improved the identification of low-abundance proteins.


Asunto(s)
Proteínas/análisis , Animales , Cromatografía Liquida , Bases de Datos Factuales , Células HeLa , Humanos , Proteínas de Insectos/análisis , Proteínas de Plantas/análisis , Proteómica , Programas Informáticos , Espectrometría de Masas en Tándem , Tracheophyta , Triatoma
17.
Nat Cell Biol ; 9(12): 1401-12, 2007 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-17994010

RESUMEN

Cell division is fundamental for all organisms. Here we report a genome-scale RNA-mediated interference screen in HeLa cells designed to identify human genes that are important for cell division. We have used a library of endoribonuclease-prepared short interfering RNAs for gene silencing and have used DNA content analysis to identify genes that induced cell cycle arrest or altered ploidy on silencing. Validation and secondary assays were performed to generate a nine-parameter loss-of-function phenoprint for each of the genes. These phenotypic signatures allowed the assignment of genes to specific functional classes by combining hierarchical clustering, cross-species analysis and proteomic data mining. We highlight the richness of our dataset by ascribing novel functions to genes in mitosis and cytokinesis. In particular, we identify two evolutionarily conserved transcriptional regulatory networks that govern cytokinesis. Our work provides an experimental framework from which the systematic analysis of novel genes necessary for cell division in human cells can begin.


Asunto(s)
División Celular/fisiología , Genoma Humano , Interferencia de ARN , Perfilación de la Expresión Génica , Células HeLa , Humanos , ARN Interferente Pequeño/metabolismo
18.
Proteomics ; 7(14): 2318-29, 2007 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-17623296

RESUMEN

LC-MS/MS analysis on a linear ion trap LTQ mass spectrometer, combined with data processing, stringent, and sequence-similarity database searching tools, was employed in a layered manner to identify proteins in organisms with unsequenced genomes. Highly specific stringent searches (MASCOT) were applied as a first layer screen to identify either known (i.e. present in a database) proteins, or unknown proteins sharing identical peptides with related database sequences. Once the confidently matched spectra were removed, the remainder was filtered against a nonannotated library of background spectra that cleaned up the dataset from spectra of common protein and chemical contaminants. The rectified spectral dataset was further subjected to rapid batch de novo interpretation by PepNovo software, followed by the MS BLAST sequence-similarity search that used multiple redundant and partially accurate candidate peptide sequences. Importantly, a single dataset was acquired at the uncompromised sensitivity with no need of manual selection of MS/MS spectra for subsequent de novo interpretation. This approach enabled a completely automated identification of novel proteins that were, otherwise, missed by conventional database searches.


Asunto(s)
Cromatografía Liquida/métodos , Genoma , Proteómica/métodos , Homología de Secuencia de Aminoácido , Espectrometría de Masas en Tándem/métodos , Proteínas Algáceas/química , Proteínas Algáceas/metabolismo , Secuencia de Aminoácidos , Chlorophyta , Cromatografía Liquida/instrumentación , Simulación por Computador , Bases de Datos de Proteínas , Proteínas de la Membrana/química , Proteínas de la Membrana/metabolismo , Datos de Secuencia Molecular , Proteómica/instrumentación , Programas Informáticos , Espectrometría de Masas en Tándem/instrumentación
19.
Anal Chem ; 79(11): 4083-93, 2007 Jun 01.
Artículo en Inglés | MEDLINE | ID: mdl-17474710

RESUMEN

Direct profiling of total lipid extracts on a hybrid LTQ Orbitrap mass spectrometer by high-resolution survey spectra clusters species of 11 major lipid classes into 7 groups, which are distinguished by their sum compositions and could be identified by accurately determined masses. Rapid acquisition of survey spectra was employed as a "top-down" screening tool that, together with the computational method of principal component analysis, revealed pronounced perturbations in the abundance of lipid precursors within the entire series of experiments. Altered lipid precursors were subsequently identified either by accurately determined masses or by in-depth MS/MS characterization that was performed on the same instrument. Hence, the sensitivity, throughput and robustness of lipidomics screens were improved without compromising the accuracy and specificity of molecular species identification. The top-down lipidomics strategy lends itself for high-throughput screens complementing ongoing functional genomics efforts.


Asunto(s)
Lípidos/análisis , Lípidos/química , Espectrometría de Masas en Tándem/métodos , Animales , Caenorhabditis elegans/química , Caenorhabditis elegans/genética , Caenorhabditis elegans/crecimiento & desarrollo , Estructura Molecular , Interferencia de ARN
20.
Nat Methods ; 4(4): 337-44, 2007 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-17351622

RESUMEN

RNA interference (RNAi) has become an important technique for loss-of-gene-function studies in mammalian cells. To achieve reliable results in an RNAi experiment, efficient and specific silencing triggers are required. Here we present genome-wide data sets for the production of endoribonuclease-prepared short interfering RNAs (esiRNAs) for human, mouse and rat. We used an algorithm to predict the optimal region for esiRNA synthesis for every protein-coding gene of these three species. We created a database, RiDDLE, for retrieval of target sequences and primer information. To test this in silico resource experimentally, we generated 16,242 esiRNAs that can be used for RNAi screening in human cells. Comparative analyses with chemically synthesized siRNAs demonstrated a high silencing efficacy of esiRNAs and a 12-fold reduction of downregulated off-target transcripts as detected by microarray analysis. Hence, the presented esiRNA libraries offer an efficient, cost-effective and specific alternative to presently available mammalian RNAi resources.


Asunto(s)
Endorribonucleasas/genética , Biblioteca Genómica , Genómica/métodos , Interferencia de ARN , ARN Interferente Pequeño/genética , Animales , Humanos , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa , Sensibilidad y Especificidad , Transcripción Genética , Transfección , Regiones no Traducidas , Interfaz Usuario-Computador
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...