Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 5 de 5
Filter
Add more filters










Database
Language
Publication year range
1.
PLoS One ; 6(8): e23786, 2011.
Article in English | MEDLINE | ID: mdl-21887319

ABSTRACT

Schistosoma japonicum is a major infectious agent of schistosomiasis. It has been reported that large number of proteins excreted and secreted by S. japonicum during its life cycle are important for its infection and survival in definitive hosts. These proteins can be used as ideal candidates for vaccines or drug targets. In this work, we analyzed the protein sequences of S. japonicum and found that compared with other proteins in S. japonicum, excretory/secretory (ES) proteins are generally longer, more likely to be stable and enzyme, more likely to contain immune-related binding peptides and more likely to be involved in regulation and metabolism processes. Based on the sequence difference between ES and non-ES proteins, we trained a support vector machine (SVM) with much higher accuracy than existing approaches. Using this SVM, we identified 191 new ES proteins in S. japonicum, and further predicted 7 potential interactions between these ES proteins and human immune proteins. Our results are useful to understand the pathogenesis of schistosomiasis and can serve as a new resource for vaccine or drug targets discovery for anti-schistosome.


Subject(s)
Helminth Proteins/analysis , Host-Pathogen Interactions/immunology , Immune System/microbiology , Schistosoma japonicum/immunology , Secretory Pathway/immunology , Amino Acid Sequence , Animals , Helminth Proteins/immunology , Helminth Proteins/metabolism , Humans , Proteomics/methods , Schistosoma japonicum/chemistry , Schistosoma japonicum/pathogenicity , Schistosomiasis japonica , Support Vector Machine
2.
J Bioinform Comput Biol ; 9(2): 299-316, 2011 Apr.
Article in English | MEDLINE | ID: mdl-21523934

ABSTRACT

Exon expression profiling technologies, including exon arrays and RNA-Seq, measure the abundance of every exon in a gene. Compared with gene expression profiling technologies like 3' array, exon expression profiling technologies could detect alterations in both transcription and alternative splicing, therefore they are expected to be more sensitive in diagnosis. However, exon expression profiling also brings higher dimension, more redundancy, and significant correlation among features. Ignoring the correlation structure among exons of a gene, a popular classification method like L1-SVM selects exons individually from each gene and thus is vulnerable to noise. To overcome this limitation, we present in this paper a new variant of SVM named Lex-SVM to incorporate correlation structure among exons and known splicing patterns to promote classification performance. Specifically, we construct a new norm, ex-norm, including our prior knowledge on exon correlation structure to regularize the coefficients of a linear SVM. Lex-SVM can be solved efficiently using standard linear programming techniques. The advantage of Lex-SVM is that it can select features group-wisely, force features in a subgroup to take equal weihts and exclude the features that contradict the majority in the subgroup. Experimental results suggest that on exon expression profile, Lex-SVM is more accurate than existing methods. Lex-SVM also generates a more compact model and selects genes more consistently in cross-validation. Unlike L1-SVM selecting only one exon in a gene, Lex-SVM assigns equal weights to as many exons in a gene as possible, lending itself easier for further interpretation.


Subject(s)
Artificial Intelligence , Disease/classification , Disease/genetics , Exons , Gene Expression Profiling/statistics & numerical data , Alternative Splicing , Computational Biology , Databases, Nucleic Acid/statistics & numerical data , Humans , Mathematical Concepts , Models, Genetic , Neoplasms/classification , Neoplasms/genetics , Programming, Linear
3.
BMC Bioinformatics ; 12 Suppl 1: S54, 2011 Feb 15.
Article in English | MEDLINE | ID: mdl-21342587

ABSTRACT

BACKGROUND: Native structures of proteins are formed essentially due to the combining effects of local and distant (in the sense of sequence) interactions among residues. These interaction information are, explicitly or implicitly, encoded into the scoring function in protein structure prediction approaches--threading approaches usually measure an alignment in the sense that how well a sequence adopts an existing structure; while the energy functions in Ab Initio methods are designed to measure how likely a conformation is near-native. Encouraging progress has been observed in structure refinement where knowledge-based or physics-based potentials are designed to capture distant interactions. Thus, it is interesting to investigate whether distant interaction information captured by the Ab Initio energy function can be used to improve threading, especially for the weakly/distant homologous templates. RESULTS: In this paper, we investigate the possibility to improve alignment-generating through incorporating distant interaction information into the alignment scoring function in a nontrivial approach. Specifically, the distant interaction information is introduced through employing an Ab Initio energy function to evaluate the "partial" decoy built from an alignment. Subsequently, a local search algorithm is utilized to optimize the scoring function.Experimental results demonstrate that with distant interaction items, the quality of generated alignments are improved on 68 out of 127 query-template pairs in Prosup benchmark. In addition, compared with state-to-art threading methods, our method performs better on alignment accuracy comparison. CONCLUSIONS: Incorporating Ab Initio energy functions into threading can greatly improve alignment accuracy.


Subject(s)
Algorithms , Proteins/chemistry , Sequence Alignment/methods , Computational Biology/methods , Models, Statistical , Protein Conformation , Software
4.
Nucleic Acids Res ; 39(9): 3864-78, 2011 May.
Article in English | MEDLINE | ID: mdl-21247874

ABSTRACT

Although accumulating evidence has provided insight into the various functions of long-non-coding RNAs (lncRNAs), the exact functions of the majority of such transcripts are still unknown. Here, we report the first computational annotation of lncRNA functions based on public microarray expression profiles. A coding-non-coding gene co-expression (CNC) network was constructed from re-annotated Affymetrix Mouse Genome Array data. Probable functions for altogether 340 lncRNAs were predicted based on topological or other network characteristics, such as module sharing, association with network hubs and combinations of co-expression and genomic adjacency. The functions annotated to the lncRNAs mainly involve organ or tissue development (e.g. neuron, eye and muscle development), cellular transport (e.g. neuronal transport and sodium ion, acid or lipid transport) or metabolic processes (e.g. involving macromolecules, phosphocreatine and tyrosine).


Subject(s)
Gene Regulatory Networks , RNA, Untranslated/physiology , Animals , Gene Expression Profiling , Genomics , Mice , Molecular Sequence Annotation , Nucleic Acid Probes/chemistry , Oligonucleotide Array Sequence Analysis , RNA, Untranslated/metabolism , Synaptic Transmission/genetics
5.
BMC Syst Biol ; 3: 65, 2009 Jun 26.
Article in English | MEDLINE | ID: mdl-19558649

ABSTRACT

BACKGROUND: MicroRNAs (miRNAs), a growing class of small RNAs with crucial regulatory roles at the post-transcriptional level, are usually found to be clustered on chromosomes. However, with the exception of a few individual cases, so far little is known about the functional consequence of this conserved clustering of miRNA loci. In animal genomes such clusters often contain non-homologous miRNA genes. One hypothesis to explain this heterogeneity suggests that clustered miRNAs are functionally related by virtue of co-targeting downstream pathways. RESULTS: Integrating of miRNA cluster information with protein protein interaction (PPI) network data, our research supports the hypothesis of the functional coordination of clustered miRNAs and links it to the topological features of miRNAs' targets in PPI network. Specifically, our results demonstrate that clustered miRNAs jointly regulate proteins in close proximity of the PPI network. The possibility that two proteins yield to this coordinated regulation is negatively correlated with their distance in PPI network. Guided by the knowledge of this preference, we found several network communities enriched with target genes of miRNA clusters. In addition, our results demonstrate that the variance of this propensity can also partly be explained by protein's connectivity and miRNA's conservation. CONCLUSION: In summary, this work supports the hypothesis of intra-cluster coordination and investigates the extent of this coordination.


Subject(s)
MicroRNAs/metabolism , Proteins/metabolism , Animals , Humans , Mice , Protein Binding , Protein Interaction Mapping , Rats
SELECTION OF CITATIONS
SEARCH DETAIL
...