Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 74
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
BMC Bioinformatics ; 21(1): 528, 2020 Nov 17.
Artigo em Inglês | MEDLINE | ID: mdl-33203354

RESUMO

BACKGROUND: Next-generation sequencing technologies revolutionized genomics by producing high-throughput reads at low cost, and this progress has prompted the recent development of de novo assemblers. Multiple assembly methods based on de Bruijn graph have been shown to be efficient for Illumina reads. However, the sequencing errors generated by the sequencer complicate analysis of de novo assembly and influence the quality of downstream genomic researches. RESULTS: In this paper, we develop a de Bruijn assembler, called Clover (clustering-oriented de novo assembler), that utilizes a novel k-mer clustering approach from the overlap-layout-consensus concept to deal with the sequencing errors generated by the Illumina platform. We further evaluate Clover's performance against several de Bruijn graph assemblers (ABySS, SOAPdenovo, SPAdes and Velvet), overlap-layout-consensus assemblers (Bambus2, CABOG and MSR-CA) and string graph assembler (SGA) on three datasets (Staphylococcus aureus, Rhodobacter sphaeroides and human chromosome 14). The results show that Clover achieves a superior assembly quality in terms of corrected N50 and E-size while remaining a significantly competitive in run time except SOAPdenovo. In addition, Clover was involved in the sequencing projects of bacterial genomes Acinetobacter baumannii TYTH-1 and Morganella morganii KT. CONCLUSIONS: The marvel clustering-based approach of Clover that integrates the flexibility of the overlap-layout-consensus approach and the efficiency of the de Bruijn graph method has high potential on de novo assembly. Now, Clover is freely available as open source software from https://oz.nthu.edu.tw/~d9562563/src.html .


Assuntos
Algoritmos , Sequenciamento de Nucleotídeos em Larga Escala , Sequência de Bases , Cromossomos Humanos Par 14/genética , Análise por Conglomerados , Genoma Bacteriano , Genômica/métodos , Humanos , Software , Fatores de Tempo
2.
Bioinformatics ; 35(10): 1677-1685, 2019 05 15.
Artigo em Inglês | MEDLINE | ID: mdl-30321266

RESUMO

MOTIVATION: High-throughput sequencing technology has revolutionized the study of metagenomics and cancer evolution. In a relatively simple environment, a metagenomics sequencing data is dominated by a few species. By analyzing the alignment of reads from microbial species, single nucleotide polymorphisms can be discovered and the evolutionary history of the populations can be reconstructed. The ever-increasing read length will allow more detailed analysis about the evolutionary history of microbial or tumor cell population. A simulator of shotgun sequences from such populations will be helpful in the development or evaluation of analysis algorithms. RESULTS: Here, we described an efficient algorithm, MetaSMC, which simulates reads from evolving microbial populations. Based on the coalescent theory, our simulator supports all evolutionary scenarios supported by other coalescent simulators. In addition, the simulator supports various substitution models, including Jukes-Cantor, HKY85 and generalized time-reversible models. The simulator also supports mutator phenotypes by allowing different mutation rates and substitution models in different subpopulations. Our algorithm ignores unnecessary chromosomal segments and thus is more efficient than standard coalescent when recombination is frequent. We showed that the process behind our algorithm is equivalent to Sequentially Markov Coalescent with an incomplete sample. The accuracy of our algorithm was evaluated by summary statistics and likelihood curves derived from Monte Carlo integration over large number of random genealogies. AVAILABILITY AND IMPLEMENTATION: MetaSMC is written in C. The source code is available at https://github.com/tarjxvf/metasmc. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Genética Populacional , Software , Algoritmos , Sequência de Bases , Sequenciamento de Nucleotídeos em Larga Escala , Metagenômica , Análise de Sequência de DNA
3.
Appl Microbiol Biotechnol ; 99(6): 2871-81, 2015 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-25359480

RESUMO

In this study, we applied a 16S ribosomal RNA (rRNA) metagenomics approach to survey inanimate hospital environments (IHEs) in a respiratory care center (RCC). A total of 16 samples, including 9 from medical devices and 7 from workstations, were analyzed. Besides, clinical isolates were retrospectively analyzed during the sampling period in the RCC. A high amount of microbial diversity was detected, with an average of 1,836 phylotypes per sample. In addition to Acinetobacter, more than 60 % of the bacterial communities present among the top 25 abundant genera were dominated by skin-associated bacteria. Differences in bacterial profiles were restricted to individual samples. Furthermore, compliance with hand hygiene guidelines may be unsatisfactory among hospital staff according to a principal coordinate analysis that indicated clustering of bacterial communities between devices and workstations for most of the sampling sites. Compared to the high incidence of clinical isolates in the RCC, only Staphylococcus and Acinetobacter were highly abundant in the IHEs. Despite Acinetobacter was the most abundant genus present in IHEs of the RCC, potential pathogens, e.g., Acinetobacter baumannii, might remain susceptible to carbapenem. This study is the first in Taiwan to demonstrate a high diversity of human-associated bacteria in the RCC via 16S rRNA metagenomics, which allows for new assessment of potential health risks in RCCs, aids in the evaluation of existing sanitation protocols, and furthers our understanding of the development of healthcare-associated infections.


Assuntos
Bactérias/classificação , Bactérias/efeitos dos fármacos , Metagenômica/métodos , Acinetobacter baumannii/classificação , Acinetobacter baumannii/efeitos dos fármacos , Alelos , Biomassa , Carbapenêmicos/farmacologia , Chryseobacterium/classificação , Chryseobacterium/efeitos dos fármacos , DNA Bacteriano/genética , Farmacorresistência Bacteriana Múltipla , Enterococcus/classificação , Enterococcus/efeitos dos fármacos , Contaminação de Equipamentos , Fômites/microbiologia , Humanos , Klebsiella pneumoniae/classificação , Klebsiella pneumoniae/efeitos dos fármacos , Testes de Sensibilidade Microbiana , Pseudomonas aeruginosa/classificação , Pseudomonas aeruginosa/efeitos dos fármacos , RNA Ribossômico 16S/genética , Análise de Sequência de DNA , Staphylococcus/classificação , Staphylococcus/efeitos dos fármacos , Taiwan
4.
Genomics ; 104(3): 215-23, 2014 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-25072866

RESUMO

We investigated the prevalence of a type IV secretion system (T4SS)-bearing plasmid among clinical isolates of carbapenem-resistant Acinetobacter baumannii (CRAB) using plasmid replicon typing. The complete sequence of a T4SS-bearing plasmid, pAB_CC, isolated from A. baumannii TYTH-1 was determined, and a comparative analysis of the T4SS gene modules was performed. Of the 129 isolates studied, GR6 (repAci6) was the most common (45 of 96 isolates) and was strongly linked with the T4SS. A comparative analysis of the T4SS locus in seven plasmid genomes, including pAB_CC, pACICU2, pABKp1, pABTJ1, p1BJAB0714, p2BJAB0868, and p2ABTCDC0715, indicated that fourteen genes on these plasmids were highly conserved compared to those of the F plasmid. Additionally, the chromosomes in the seven representative isolates may be evolutionarily distinct from their intrinsic T4SS-bearing plasmids, suggesting that the two T4SS lineages emerged long before the appearance of EC II. These two lineages are now widespread in A. baumannii strains.


Assuntos
Acinetobacter baumannii/genética , Sistemas de Secreção Bacterianos/genética , Plasmídeos/genética , Acinetobacter baumannii/efeitos dos fármacos , Sequência de Bases , Carbapenêmicos/farmacologia , Sequência Conservada , Evolução Molecular , Genes Bacterianos , Dados de Sequência Molecular , Filogenia , Plasmídeos/química , Plasmídeos/isolamento & purificação
5.
BMC Genomics ; 15: 815, 2014 Sep 26.
Artigo em Inglês | MEDLINE | ID: mdl-25260865

RESUMO

BACKGROUND: Carbapenem-resistance in Acinetobacter baumannii has gradually become a global challenge. To identify the genes involved in carbapenem resistance in A. baumannii, the transcriptomic responses of the completely sequenced strain ATCC 17978 selected with 0.5 mg/L (IPM-2 m) and 2 mg/L (IPM-8 m) imipenem were investigated using RNA-sequencing to identify differences in the gene expression patterns. RESULTS: A total of 88 and 68 genes were differentially expressed in response to IPM-2 m and IPM-8 m selection, respectively. Among the expressed genes, 50 genes were highly expressed in IPM-2 m, 30 genes were highly expressed in IPM-8 m, and 38 genes were expressed common in both strains. Six groups of genes were simultaneously expressed in IPM-2 m and IPM-8 m mutants. The three gene groups involved in DNA recombination were up-regulated, including recombinase, transposase and DNA repair, and beta-lactamase OXA-95 and homologous recombination. The remaining gene groups involved in biofilm formation were down-regulated, including quorum sensing, secretion systems, and the csu operon. The antibiotic resistance determinants, including RND efflux transporters and multidrug resistance pumps, were over-expressed in response to IPM-2 m selection, followed by a decrease in response to IPM-8 m selection. Among the genes over-expressed in both strains, blaOXA-95, previously clustered with the blaOXA-51-like family, showed 14-fold (IPM-2 m) to 330-fold (IPM-8 m) over-expression. The expression of blaOXA-95 in IPM-2 m and IPM-8 m cells was positively correlated with the rate of imipenem hydrolysis, as demonstrated through Liquid Chromatography-Mass Spectrometry/Mass Spectrometry, suggesting that blaOXA-95 plays a critical role in conferring carbapenem resistance. In addition, A. baumannii shows an inverse relationship between carbapenem resistance and biofilm production. CONCLUSION: Gene recombination and blaOXA-95 play critical roles in carbapenem resistance in A. baumannii. Taken together, the results of the present study provide a foundation for future studies of the network systems associated with carbapenem resistance.


Assuntos
Acinetobacter baumannii/genética , Antibacterianos/farmacologia , Genes Bacterianos , Imipenem/farmacologia , Acinetobacter baumannii/efeitos dos fármacos , Acinetobacter baumannii/metabolismo , Antibacterianos/análise , Antibacterianos/metabolismo , Proteínas de Bactérias/metabolismo , Biofilmes/efeitos dos fármacos , Cromatografia Líquida de Alta Pressão , Perfilação da Expressão Gênica , Hidrólise , Imipenem/análise , Imipenem/metabolismo , Testes de Sensibilidade Microbiana , Espectrometria de Massas em Tandem , Transcriptoma , beta-Lactamases/metabolismo
6.
Nucleic Acids Res ; 40(6): e42, 2012 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-22199257

RESUMO

Next-generation sequencing (NGS) technologies-based transcriptomic profiling method often called RNA-seq has been widely used to study global gene expression, alternative exon usage, new exon discovery, novel transcriptional isoforms and genomic sequence variations. However, this technique also poses many biological and informatics challenges to extracting meaningful biological information. The RNA-seq data analysis is built on the foundation of high quality initial genome localization and alignment information for RNA-seq sequences. Toward this goal, we have developed RNASEQR to accurately and effectively map millions of RNA-seq sequences. We have systematically compared RNASEQR with four of the most widely used tools using a simulated data set created from the Consensus CDS project and two experimental RNA-seq data sets generated from a human glioblastoma patient. Our results showed that RNASEQR yields more accurate estimates for gene expression, complete gene structures and new transcript isoforms, as well as more accurate detection of single nucleotide variants (SNVs). RNASEQR analyzes raw data from RNA-seq experiments effectively and outputs results in a manner that is compatible with a wide variety of specialized downstream analyses on desktop computers.


Assuntos
Perfilação da Expressão Gênica , Análise de Sequência de RNA , Software , Glioblastoma/genética , Humanos , Anotação de Sequência Molecular , Reprodutibilidade dos Testes
7.
BMC Bioinformatics ; 14 Suppl 2: S12, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-23368411

RESUMO

BACKGROUND: In the last decade, a considerable amount of research has been devoted to investigating the phylogenetic properties of organisms from a systems-level perspective. Most studies have focused on the classification of organisms based on structural comparison and local alignment of metabolic pathways. In contrast, global alignment of multiple metabolic networks complements sequence-based phylogenetic analyses and provides more comprehensive information. RESULTS: We explored the phylogenetic relationships between microorganisms through global alignment of multiple metabolic networks. The proposed approach integrates sequence homology data with topological information of metabolic networks. In general, compared to recent studies, the resulting trees reflect the living style of organisms as well as classical taxa. Moreover, for phylogenetically closely related organisms, the classification results are consistent with specific metabolic characteristics, such as the light-harvesting systems, fermentation types, and sources of electrons in photosynthesis. CONCLUSIONS: We demonstrate the usefulness of global alignment of multiple metabolic networks to infer phylogenetic relationships between species. In addition, our exhaustive analysis of microbial metabolic pathways reveals differences in metabolic features between phylogenetically closely related organisms. With the ongoing increase in the number of genomic sequences and metabolic annotations, the proposed approach will help identify phenotypic variations that may not be apparent based solely on sequence-based classification.


Assuntos
Genômica , Redes e Vias Metabólicas , Filogenia , Algoritmos , Análise por Conglomerados , Fermentação , Lactobacillus/classificação , Complexos de Proteínas Captadores de Luz/análise , Fotossíntese , Prochlorococcus/classificação , Bactérias Redutoras de Enxofre/classificação , Synechococcus/classificação
8.
BMC Genomics ; 14: 824, 2013 Nov 23.
Artigo em Inglês | MEDLINE | ID: mdl-24267727

RESUMO

BACKGROUND: SUMOylation, as part of the epigenetic regulation of transcription, has been intensively studied in lower eukaryotes that contain only a single SUMO protein; however, the functions of SUMOylation during mammalian epigenetic transcriptional regulation are largely uncharacterized. Mammals express three major SUMO paralogues: SUMO-1, SUMO-2, and SUMO-3 (normally referred to as SUMO-1 and SUMO-2/3). Herpesviruses, including Kaposi's sarcoma associated herpesvirus (KSHV), seem to have evolved mechanisms that directly or indirectly modulate the SUMO machinery in order to evade host immune surveillance, thus advancing their survival. Interestingly, KSHV encodes a SUMO E3 ligase, K-bZIP, with specificity toward SUMO-2/3 and is an excellent model for investigating the global functional differences between SUMO paralogues. RESULTS: We investigated the effect of experimental herpesvirus reactivation in a KSHV infected B lymphoma cell line on genomic SUMO-1 and SUMO-2/3 binding profiles together with the potential role of chromatin SUMOylation in transcription regulation. This was carried out via high-throughput sequencing analysis. Interestingly, chromatin immunoprecipitation sequencing (ChIP-seq) experiments showed that KSHV reactivation is accompanied by a significant increase in SUMO-2/3 modification around promoter regions, but SUMO-1 enrichment was absent. Expression profiling revealed that the SUMO-2/3 targeted genes are primarily highly transcribed genes that show no expression changes during viral reactivation. Gene ontology analysis further showed that these genes are involved in cellular immune responses and cytokine signaling. High-throughput annotation of SUMO occupancy of transcription factor binding sites (TFBS) pinpointed the presence of three master regulators of immune responses, IRF-1, IRF-2, and IRF-7, as potential SUMO-2/3 targeted transcriptional factors after KSHV reactivation. CONCLUSION: Our study is the first to identify differential genome-wide SUMO modifications between SUMO paralogues during herpesvirus reactivation. Our findings indicate that SUMO-2/3 modification near protein-coding gene promoters occurs in order to maintain host immune-related gene unaltered during viral reactivation.


Assuntos
Cromatina/metabolismo , Herpesvirus Humano 8/fisiologia , Proteínas Modificadoras Pequenas Relacionadas à Ubiquitina/metabolismo , Sumoilação , Ubiquitinas/metabolismo , Ativação Viral , Linhagem Celular Tumoral , Cromatina/virologia , Imunoprecipitação da Cromatina , Epigênese Genética/imunologia , Ontologia Genética , Genes MHC da Classe II , Células HEK293 , Humanos , Fatores Reguladores de Interferon/genética , Fatores Reguladores de Interferon/metabolismo , Regiões Promotoras Genéticas , Ligação Proteica , Fatores de Transcrição/metabolismo , Transcriptoma
9.
Bioorg Med Chem Lett ; 23(23): 6286-91, 2013 Dec 01.
Artigo em Inglês | MEDLINE | ID: mdl-24144850

RESUMO

Damage to DNA is caused by ionizing radiation, genotoxic chemicals or collapsed replication forks. When DNA is damaged or cells fail to respond, a mutation that is associated with breast or ovarian cancer may occur. Mammalian cells control and stabilize the genome using a cell cycle checkpoint to prevent damage to DNA or to repair damaged DNA. Checkpoint kinase 2 (Chk2) is one of the important kinases, which strongly affects DNA-damage and plays an important role in the response to the breakage of DNA double-strands and related lesions. Therefore, this study concerns Chk2. Its purpose is to find potential inhibitors using the pharmacophore hypotheses (PhModels) and virtual screening techniques. PhModels can identify inhibitors with high biological activities and virtual screening techniques are used to screen the database of the National Cancer Institute (NCI) to retrieve compounds that exhibit all of the pharmacophoric features of potential inhibitors with high interaction energy. Ten PhModels were generated using the HypoGen best algorithm. The established PhModel, Hypo01, was evaluated by performing a cost function analysis of its correlation coefficient (r), root mean square deviation (RMSD), cost difference, and configuration cost, with the values 0.955, 1.28, 192.51, and 16.07, respectively. The result of Fischer's cross-validation test for the Hypo01 model yielded a 95% confidence level, and the correlation coefficient of the testing set (rtest) had a best value of 0.81. The potential inhibitors were then chosen from the NCI database by Hypo01 model screening and molecular docking using the cdocker docking program. Finally, the selected compounds exhibited the identified pharmacophoric features and had a high interaction energy between the ligand and the receptor. Eighty-three potential inhibitors for Chk2 are retrieved for further study.


Assuntos
Quinase do Ponto de Checagem 2/antagonistas & inibidores , Inibidores de Proteínas Quinases/química , Inibidores de Proteínas Quinases/farmacologia , Quinase do Ponto de Checagem 2/química , Dano ao DNA , Desenho de Fármacos , Humanos , Modelos Moleculares , Relação Quantitativa Estrutura-Atividade , Relação Estrutura-Atividade , Termodinâmica
10.
J Bacteriol ; 194(24): 6974, 2012 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-23209228

RESUMO

Acinetobacter baumannii has emerged recently as a major cause of health care-associated infections due to the extent of its antimicrobial resistance and its propensity to cause large nosocomial outbreaks. Here we report the genome sequence of Acinetobacter baumannii TYTH-1 isolated in Taiwan during 2008.


Assuntos
Acinetobacter baumannii/genética , Genoma Bacteriano , Infecções por Acinetobacter/microbiologia , Acinetobacter baumannii/efeitos dos fármacos , Acinetobacter baumannii/isolamento & purificação , Antibacterianos/farmacologia , Bacteriemia/microbiologia , Proteínas de Bactérias/genética , DNA Bacteriano/genética , Farmacorresistência Bacteriana Múltipla , Humanos , Dados de Sequência Molecular , RNA Bacteriano/genética , Análise de Sequência de DNA , Taiwan , beta-Lactamases/genética
11.
BMC Genomics ; 13: 157, 2012 Apr 27.
Artigo em Inglês | MEDLINE | ID: mdl-22537274

RESUMO

BACKGROUND: Microorganisms able to grow under artificial culture conditions comprise only a small proportion of the biosphere's total microbial community. Until recently, scientists have been unable to perform thorough analyses of difficult-to-culture microorganisms due to limitations in sequencing technology. As modern techniques have dramatically increased sequencing rates and rapidly expanded the number of sequenced genomes, in addition to traditional taxonomic classifications which focus on the evolutionary relationships of organisms, classifications of the genomes based on alternative points of view may help advance our understanding of the delicate relationships of organisms. RESULTS: We have developed a proteome-based method for classifying microbial species. This classification method uses a set of probes comprising short, highly conserved amino acid sequences. For each genome, in silico translation is performed to obtained its proteome, based on which a probe-set frequency pattern is generated. Then, the probe-set frequency patterns are used to cluster the proteomes/genomes. CONCLUSIONS: Features of the proposed method include a high running speed in challenge of a large number of genomes, and high applicability for classifying organisms with incomplete genome sequences. Moreover, the probe-set clustering method is sensitive to the metabolic phenotypic similarities/differences among species and is thus supposed potential for the classification or differentiation of closely-related organisms.


Assuntos
Bactérias/genética , Bactérias/metabolismo , Genômica/métodos , Transferência Genética Horizontal/genética , Filogenia , Proteômica
12.
BMC Genomics ; 13: 491, 2012 Sep 18.
Artigo em Inglês | MEDLINE | ID: mdl-22988976

RESUMO

BACKGROUND: RNA interference (RNAi) is commonly applied in genome-scale gene functional screens. However, a one-on-one RNAi analysis that targets each gene is cost-ineffective and laborious. Previous studies have indicated that siRNAs can also affect RNAs that are near-perfectly complementary, and this phenomenon has been termed an off-target effect. This phenomenon implies that it is possible to silence several genes simultaneously with a carefully designed siRNA. RESULTS: We propose a strategy that is combined with a heuristic algorithm to design suitable siRNAs that can target multiple genes and a group testing method that would reduce the number of required RNAi experiments in a large-scale RNAi analysis. To verify the efficacy of our strategy, we used the Orchid expressed sequence tag data as a case study to screen the putative transcription factors that are involved in plant disease responses. According to our computation, 94 qualified siRNAs were sufficient to examine all of the predicated 229 transcription factors. In addition, among the 94 computer-designed siRNAs, an siRNA that targets both TF15 (a previously identified transcription factor that is involved in the plant disease-response pathway) and TF21 was introduced into orchids. The experimental results showed that this siRNA can simultaneously silence TF15 and TF21, and application of our strategy successfully confirmed that TF15 is involved in plant defense responses. Interestingly, our second-round analysis, which used an siRNA specific to TF21, indicated that TF21 is a previously unidentified transcription factor that is related to plant defense responses. CONCLUSIONS: Our computational results showed that it is possible to screen all genes with fewer experiments than would be required for the traditional one-on-one RNAi screening. We also verified that our strategy is capable of identifying genes that are involved in a specific phenotype.


Assuntos
Algoritmos , Liliaceae/genética , Proteínas de Plantas/genética , Interferência de RNA , RNA Interferente Pequeno/genética , Fatores de Transcrição/genética , Agrobacterium tumefaciens/genética , Sequência de Bases , Biologia Computacional , Etiquetas de Sequências Expressas , Liliaceae/imunologia , Dados de Sequência Molecular , Doenças das Plantas/genética , Proteínas de Plantas/antagonistas & inibidores , Proteínas de Plantas/metabolismo , Fatores de Transcrição/antagonistas & inibidores , Fatores de Transcrição/metabolismo , Transformação Genética
13.
BMC Genomics ; 13 Suppl 7: S4, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-23282187

RESUMO

BACKGROUND: The opportunistic enterobacterium, Morganella morganii, which can cause bacteraemia, is the ninth most prevalent cause of clinical infections in patients at Changhua Christian Hospital, Taiwan. The KT strain of M. morganii was isolated during postoperative care of a cancer patient with a gallbladder stone who developed sepsis caused by bacteraemia. M. morganii is sometimes encountered in nosocomial settings and has been causally linked to catheter-associated bacteriuria, complex infections of the urinary and/or hepatobiliary tracts, wound infection, and septicaemia. M. morganii infection is associated with a high mortality rate, although most patients respond well to appropriate antibiotic therapy. To obtain insights into the genome biology of M. morganii and the mechanisms underlying its pathogenicity, we used Illumina technology to sequence the genome of the KT strain and compared its sequence with the genome sequences of related bacteria. RESULTS: The 3,826,919-bp sequence contained in 58 contigs has a GC content of 51.15% and includes 3,565 protein-coding sequences, 72 tRNA genes, and 10 rRNA genes. The pathogenicity-related genes encode determinants of drug resistance, fimbrial adhesins, an IgA protease, haemolysins, ureases, and insecticidal and apoptotic toxins as well as proteins found in flagellae, the iron acquisition system, a type-3 secretion system (T3SS), and several two-component systems. Comparison with 14 genome sequences from other members of Enterobacteriaceae revealed different degrees of similarity to several systems found in M. morganii. The most striking similarities were found in the IS4 family of transposases, insecticidal toxins, T3SS components, and proteins required for ethanolamine use (eut operon) and cobalamin (vitamin B12) biosynthesis. The eut operon and the gene cluster for cobalamin biosynthesis are not present in the other Proteeae genomes analysed. Moreover, organisation of the 19 genes of the eut operon differs from that found in the other non-Proteeae enterobacterial genomes. CONCLUSIONS: This is the first genome sequence of M. morganii, which is a clinically relevant pathogen. Comparative genome analysis revealed several pathogenicity-related genes and novel genes not found in the genomes of other members of Proteeae. Thus, the genome sequence of M. morganii provides important information concerning virulence and determinants of fitness in this pathogen.


Assuntos
Genoma Bacteriano , Morganella morganii/genética , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Mapeamento de Sequências Contíguas , Farmacorresistência Bacteriana , Infecções por Bactérias Gram-Negativas/microbiologia , Humanos , Morganella morganii/isolamento & purificação , Morganella morganii/patogenicidade , Proteus mirabilis/genética , Análise de Sequência de DNA
14.
J Chem Inf Model ; 52(1): 146-55, 2012 Jan 23.
Artigo em Inglês | MEDLINE | ID: mdl-22142286

RESUMO

FMS-like tyrosine kinase 3 (FLT-3) is strongly correlated with acute myeloid leukemia, but no FLT-3-inhibitor cocomplex structure is available to assist the design of therapeutic inhibitors. Hence, we propose a dual-layer 3D-QSAR model for FLT-3 that integrates the pharmacophore, CoMFA, and CoMSIA. We then coupled the model with the fragment-based design strategy to identify novel FLT-3 inhibitors. In the first layer, the previously established model, Hypo02, was evaluated in terms of its correlation coefficient (r), RMS, cost difference, and configuration cost, with values of 0.930, 1.24, 106.45, and 16.44, respectively. Moreover, Fischer's cross-validation test of data generated by Hypo02 yielded a 98% confidence level, and the validation of the testing set yielded a best r value of 0.87. The features of Hypo02 were separated into two parts and then used to screen the MiniMaybridge fragment compound database. Nine novel FLT-3 inhibitors were generated in this layer. In the second layer, Hypo02 was subjected to an alignment rule to generate CoMFA- and CoMSIA-based models, for which the partial least-squares validation method was utilized. The values of q(2), r(2), and predictive r(2) were 0.58, 0.98, and 0.76, respectively, derived from the CoMFA model with steric and electrostatic fields. The CoMSIA model with five different fields yielded values of 0.54, 0.97, and 0.76 for q(2), r(2), and predictive r(2), respectively. The CoMFA and CoMSIA models were used to constrain 3D structures of the nine novel FLT-3 inhibitors. This dual-layer 3D-QSAR model constitutes a valuable tool to easily and quickly screen and optimize novel potential FLT-3 inhibitors for the treatment of acute myeloid leukemia.


Assuntos
Antineoplásicos/química , Simulação por Computador , Inibidores de Proteínas Quinases/química , Tirosina Quinase 3 Semelhante a fms/antagonistas & inibidores , Antineoplásicos/farmacologia , Sítios de Ligação , Desenho Assistido por Computador , Desenho de Fármacos , Humanos , Interações Hidrofóbicas e Hidrofílicas , Análise dos Mínimos Quadrados , Leucemia Mieloide Aguda/tratamento farmacológico , Leucemia Mieloide Aguda/enzimologia , Modelos Moleculares , Ligação Proteica , Inibidores de Proteínas Quinases/farmacologia , Relação Quantitativa Estrutura-Atividade , Eletricidade Estática , Termodinâmica , Tirosina Quinase 3 Semelhante a fms/metabolismo
15.
Comp Funct Genomics ; 2012: 653174, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22952430

RESUMO

Genetic robustness refers to a compensatory mechanism for buffering deleterious mutations or environmental variations. Gene duplication has been shown to provide such functional backups. However, the overall contribution of duplication-based buffering for genetic robustness is rather small. In this study, we investigated whether transcriptional compensation also exists among genes that share similar functions without sequence homology. A set of nonhomologous synthetic-lethal gene pairs was assessed by using a coexpression network, protein-protein interactions, and other types of genetic interactions in yeast. Our results are notably different from those of previous studies on buffering paralogs. The low expression similarity and the conditional coexpression alone do not play roles in identifying the functionally compensatory genes. Additional properties such as synthetic-lethal interaction, the ratio of shared common interacting partners, and the degree of coregulation were, at least in part, necessary to extract functional compensatory genes. Our network-based approach is applicable to select several well-documented cases of compensatory gene pairs and a set of new pairs. The results suggest that transcriptional reprogramming plays a limited role in functional compensation among nonhomologous genes. Our study aids in understanding the mechanism and features of functional compensation more in detail.

16.
Nucleic Acids Res ; 38(Web Server issue): W221-7, 2010 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-20538651

RESUMO

SoRT(2) is a web server that allows the user to perform genome rearrangement analysis involving reversals, generalized transpositions and translocations (including fusions and fissions), and infer phylogenetic trees of genomes being considered based on their pairwise genome rearrangement distances. It takes as input two or more linear/circular multi-chromosomal gene (or synteny block) orders in FASTA-like format. When the input is two genomes, SoRT(2) will quickly calculate their rearrangement distance, as well as a corresponding optimal scenario by highlighting the genes involved in each rearrangement operation. In the case of multiple genomes, SoRT(2) will also construct phylogenetic trees of these genomes based on a matrix of their pairwise rearrangement distances using distance-based approaches, such as neighbor-joining (NJ), unweighted pair group method with arithmetic mean (UPGMA) and Fitch-Margoliash (FM) methods. In addition, if the function of computing jackknife support values is selected, SoRT(2) will further perform the jackknife analysis to evaluate statistical reliability of the constructed NJ, UPGMA and FM trees. SoRT(2) is available online at http://bioalgorithm.life.nctu.edu.tw/SORT2/.


Assuntos
Cromossomos , Evolução Molecular , Genômica/métodos , Filogenia , Software , Animais , Inversão Cromossômica , Cães , Ordem dos Genes , Genoma , Genoma Bacteriano , Genoma Mitocondrial , Humanos , Internet , Camundongos , Ratos , Sintenia , Translocação Genética
17.
ScientificWorldJournal ; 2012: 785187, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22693432

RESUMO

Taiwan red-feathered country chickens (TRFCCs) are one of the main meat resources in Taiwan. Due to the lack of any systematic breeding programs to improve egg productivity, the egg production rate of this breed has gradually decreased. The prediction by zone (PreZone) program was developed to select the chickens with low egg productivity so as to improve the egg productivity of TRFCCs before they reach maturity. Three groups (A, B, and C) of chickens were used in this study. Two approaches were used to identify chickens with low egg productivity. The first approach used predictions based on a single dataset, and the second approach used predictions based on the union of two datasets. The levels of four serum proteins, including apolipoprotein A-I, vitellogenin, X protein (an IGF-I-like protein), and apo VLDL-II, were measured in chickens that were 8, 14, 22, or 24 weeks old. Total egg numbers were recorded for each individual bird during the egg production period. PreZone analysis was performed using the four serum protein levels as selection parameters, and the results were compared to those obtained using a first-order multiple linear regression method with the same parameters. The PreZone program provides another prediction method that can be used to validate datasets with a low correlation between response and predictors. It can be used to find low and improve egg productivity in TRFCCs by selecting the best chickens before they reach maturity.


Assuntos
Proteínas Sanguíneas/análise , Galinhas/classificação , Galinhas/fisiologia , Ovos , Oviposição/fisiologia , Animais , Feminino
18.
ScientificWorldJournal ; 2012: 365104, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22778697

RESUMO

The direct sequencing of PCR products generates heterozygous base-calling fluorescence chromatograms that are useful for identifying single-nucleotide polymorphisms (SNPs), insertion-deletions (indels), short tandem repeats (STRs), and paralogous genes. Indels and STRs can be easily detected using the currently available Indelligent or ShiftDetector programs, which do not search reference sequences. However, the detection of other genomic variants remains a challenge due to the lack of appropriate tools for heterozygous base-calling fluorescence chromatogram data analysis. In this study, we developed a free web-based program, Mixed Sequence Reader (MSR), which can directly analyze heterozygous base-calling fluorescence chromatogram data in .abi file format using comparisons with reference sequences. The heterozygous sequences are identified as two distinct sequences and aligned with reference sequences. Our results showed that MSR may be used to (i) physically locate indel and STR sequences and determine STR copy number by searching NCBI reference sequences; (ii) predict combinations of microsatellite patterns using the Federal Bureau of Investigation Combined DNA Index System (CODIS); (iii) determine human papilloma virus (HPV) genotypes by searching current viral databases in cases of double infections; (iv) estimate the copy number of paralogous genes, such as ß-defensin 4 (DEFB4) and its paralog HSPDP3.


Assuntos
Algoritmos , Pareamento de Bases/genética , DNA/genética , Triagem de Portadores Genéticos/métodos , Análise de Sequência de DNA/métodos , Software , Sequência de Bases , Internet , Dados de Sequência Molecular
19.
BMC Bioinformatics ; 12: 281, 2011 Jul 12.
Artigo em Inglês | MEDLINE | ID: mdl-21749690

RESUMO

BACKGROUND: Identifying key components in biological processes and their associations is critical for deciphering cellular functions. Recently, numerous gene expression and molecular interaction experiments have been reported in Saccharomyces cerevisiae, and these have enabled systematic studies. Although a number of approaches have been used to predict gene functions and interactions, tools that analyze the essential coordination of functional components in cellular processes still need to be developed. RESULTS: In this work, we present a new approach to study the cooperation of functional modules (sets of functionally related genes) in a specific cellular process. A cooperative module pair is defined as two modules that significantly cooperate with certain functional genes in a cellular process. This method identifies cooperative module pairs that significantly influence a cellular process and the correlated genes and interactions that are essential to that process. Using the yeast cell cycle as an example, we identified 101 cooperative module associations among 82 modules, and importantly, we established a cell cycle-specific cooperative module network. Most of the identified module pairs cover cooperative pathways and components essential to the cell cycle. We found that 14, 36, 18, 15, and 20 cooperative module pairs significantly cooperate with genes regulated in early G1, late G1, S, G2, and M phase, respectively. Fifty-nine module pairs that correlate with Cdc28 and other essential regulators were also identified. These results are consistent with previous studies and demonstrate that our methodology is effective for studying cooperative mechanisms in the cell cycle. CONCLUSIONS: In this work, we propose a new approach to identifying condition-related cooperative interactions, and importantly, we establish a cell cycle-specific cooperation module network. These results provide a global view of the cell cycle and the method can be used to discover the dynamic coordination properties of functional components in other cellular processes.


Assuntos
Proteínas de Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/citologia , Saccharomyces cerevisiae/genética , Ciclo Celular , Perfilação da Expressão Gênica , Regulação Fúngica da Expressão Gênica , Mitose , Saccharomyces cerevisiae/fisiologia , Proteínas de Saccharomyces cerevisiae/metabolismo , Transdução de Sinais
20.
BMC Bioinformatics ; 12 Suppl 1: S17, 2011 Feb 15.
Artigo em Inglês | MEDLINE | ID: mdl-21342546

RESUMO

BACKGROUND: Signal transduction is the major mechanism through which cells transmit external stimuli to evoke intracellular biochemical responses. Understanding relationship between external stimuli and corresponding cellular responses, as well as the subsequent effects on downstream genes, is a major challenge in systems biology. Thus, a systematic approach to integrate experimental data and qualitative knowledge to identify the physiological consequences of environmental stimuli is needed. RESULTS: In present study, we employed a genetic algorithm-based Boolean model to represent NF-κB signaling pathway. We were able to capture feedback and crosstalk characteristics to enhance our understanding on the acute and chronic inflammatory response. Key network components affecting the response dynamics were identified. CONCLUSIONS: We designed an effective algorithm to elucidate the process of immune response using comprehensive knowledge about network structure and limited experimental data on dynamic responses. This approach can potentially be implemented for large-scale analysis on cellular processes and organism behaviors.


Assuntos
Algoritmos , Inflamação/metabolismo , Modelos Biológicos , NF-kappa B/metabolismo , Transdução de Sinais , Inflamação/imunologia , NF-kappa B/imunologia , Receptor Cross-Talk
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA