RESUMO
The bacterium Aeromonas sp. (CGMCC 2226) can enantioselectively scavenge D-isomer, making L-amino acid derivatives (AADs) in high ee. The enantioselective scavenger (ES) has shown a broad substrate scope. Eleven L-AADs, Phe derivatives substituted with methyl-, mono- and dichloro-, bromo-, and nitro-group, were produced in high ee from corresponding racemates.
Assuntos
Aeromonas/metabolismo , Aminoácidos/síntese química , Aeromonas/crescimento & desenvolvimento , Aminoácidos/química , Modelos Moleculares , Estrutura Molecular , Rhodotorula/efeitos dos fármacos , EstereoisomerismoRESUMO
A two-layer method based on support vector machines (SVMs) has been developed to distinguish epoxide hydrolases (EHs) from other enzymes and to classify its subfamilies using its primary protein sequences. SVM classifiers were built using three different feature vectors extracted from the primary sequence of EHs: the amino acid composition (AAC), the dipeptide composition (DPC), and the pseudo-amino acid composition (PAAC). Validated by 5-fold cross tests, the first layer SVM classifier can differentiate EHs and non-EHs with an accuracy of 94.2% and has a Matthew's correlation coefficient (MCC) of 0.84. Using 2-fold cross validation, PAAC-based second layer SVM can further classify EH subfamilies with an overall accuracy of 90.7% and MCC of 0.87 as compared to AAC (80.0%) and DPC (84.9%). A program called EHPred has also been developed to assist readers to recognize EHs and to classify their subfamilies using primary protein sequences with greater accuracy.
Assuntos
Algoritmos , Inteligência Artificial , Epóxido Hidrolases/química , Epóxido Hidrolases/classificação , Reconhecimento Automatizado de Padrão/métodos , Alinhamento de Sequência/métodos , Análise de Sequência de Proteína/métodos , Sequência de Aminoácidos , Metodologias Computacionais , Dados de Sequência Molecular , Homologia de Sequência de AminoácidosRESUMO
Knowledge of the evolution of pathogens is of great medical and biological significance to the prevention, diagnosis, and therapy of infectious diseases. In order to understand the origin and evolution of the SARS-CoV (severe acute respiratory syndrome-associated coronavirus), we collected complete genome sequences of all viruses available in GenBank, and made comparative analyses with the SARS-CoV. Genomic signature analysis demonstrates that the coronaviruses all take the TGTT as their richest tetranucleotide except the SARS-CoV. A detailed analysis of the forty-two complete SARS-CoV genome sequences revealed the existence of two distinct genotypes, and showed that these isolates could be classified into four groups. Our manual analysis of the BLASTN results demonstrates that the HE (hemagglutinin-esterase) gene exists in the SARS-CoV, and many mutations made it unfamiliar to us.
Assuntos
Evolução Molecular , Variação Genética , Genoma Viral , Coronavírus Relacionado à Síndrome Respiratória Aguda Grave/genética , Motivos de Aminoácidos , Substituição de Aminoácidos , Composição de Bases , Códon/genética , Biologia Computacional , Análise Mutacional de DNA , Transferência Genética Horizontal , FilogeniaRESUMO
Annotation of the genome sequence of the SARS-CoV (severe acute respiratory syndrome-associated coronavirus) is indispensable to understand its evolution and pathogenesis. We have performed a full annotation of the SARS-CoV genome sequences by using annotation programs publicly available or developed by ourselves. Totally, 21 open reading frames (ORFs) of genes or putative uncharacterized proteins (PUPs) were predicted. Seven PUPs had not been reported previously, and two of them were predicted to contain transmembrane regions. Eight ORFs partially overlapped with or embedded into those of known genes, revealing that the SARS-CoV genome is a small and compact one with overlapped coding regions. The most striking discovery is that an ORF locates on the minus strand. We have also annotated non-coding regions and identified the transcription regulating sequences (TRS) in the intergenic regions. The analysis of TRS supports the minus strand extending transcription mechanism of coronavirus. The SNP analysis of different isolates reveals that mutations of the sequences do not affect the prediction results of ORFs.
Assuntos
Genoma Viral , Coronavírus Relacionado à Síndrome Respiratória Aguda Grave/genética , Substituição de Aminoácidos , Composição de Bases , Sequência de Bases , Biologia Computacional/métodos , Ponto Isoelétrico , Modelos Genéticos , Dados de Sequência Molecular , Peso Molecular , Fases de Leitura Aberta , Análise de Sequência , Transcrição GênicaRESUMO
The R (replicase) protein is the uniquely defined non-structural protein (NSP) responsible for RNA replication, mutation rate or fidelity, regulation of transcription in coronaviruses and many other ssRNA viruses. Based on our complete genome sequences of four isolates (BJ01-BJ04) of SARS-CoV from Beijing, China, we analyzed the structure and predicted functions of the R protein in comparison with 13 other isolates of SARS-CoV and 6 other coronaviruses. The entire ORF (open-reading frame) encodes for two major enzyme activities, RNA-dependent RNA polymerase (RdRp) and proteinase activities. The R polyprotein undergoes a complex proteolytic process to produce 15 function-related peptides. A hydrophobic domain (HOD) and a hydrophilic domain (HID) are newly identified within NSP1. The substitution rate of the R protein is close to the average of the SARS-CoV genome. The functional domains in all NSPs of the R protein give different phylogenetic results that suggest their different mutation rate under selective pressure. Eleven highly conserved regions in RdRp and twelve cleavage sites by 3CLP (chymotrypsin-like protein) have been identified as potential drug targets. Findings suggest that it is possible to obtain information about the phylogeny of SARS-CoV, as well as potential tools for drug design, genotyping and diagnostics of SARS.
Assuntos
Genoma Viral , Mutação/genética , Filogenia , RNA Polimerase Dependente de RNA/genética , Coronavírus Relacionado à Síndrome Respiratória Aguda Grave/genética , Sequência de Aminoácidos , Composição de Bases , Sequência de Bases , Análise por Conglomerados , Biologia Computacional , Sequência Conservada/genética , Evolução Molecular , Componentes do Gene , Dados de Sequência Molecular , Estrutura Terciária de Proteína , Análise de Sequência de DNARESUMO
Beijing has been one of the epicenters attacked most severely by the SARS-CoV (severe acute respiratory syndrome-associated coronavirus) since the first patient was diagnosed in one of the city's hospitals. We now report complete genome sequences of the BJ Group, including four isolates (Isolates BJ01, BJ02, BJ03, and BJ04) of the SARS-CoV. It is remarkable that all members of the BJ Group share a common haplotype, consisting of seven loci that differentiate the group from other isolates published to date. Among 42 substitutions uniquely identified from the BJ group, 32 are non-synonymous changes at the amino acid level. Rooted phylogenetic trees, proposed on the basis of haplotypes and other sequence variations of SARS-CoV isolates from Canada, USA, Singapore, and China, gave rise to different paradigms but positioned the BJ Group, together with the newly discovered GD01 (GD-Ins29) in the same clade, followed by the H-U Group (from Hong Kong to USA) and the H-T Group (from Hong Kong to Toronto), leaving the SP Group (Singapore) more distant. This result appears to suggest a possible transmission path from Guangdong to Beijing/Hong Kong, then to other countries and regions.
Assuntos
Genoma Viral , Coronavírus Relacionado à Síndrome Respiratória Aguda Grave/genética , Haplótipos , Humanos , Mutação , Fases de Leitura Aberta , FilogeniaRESUMO
The application of ionic liquids based microwave-assisted extraction (ILMAE) was successfully developed for extracting three alkaloids N-nornuciferine, O-nornuciferine, and nuciferine from lotus leaf. Seven kinds of 1-alkyl-3-methylimidazolium with different cations and anions were investigated in this work and 1.0M 1-hexyl-3-methylimidazolium bromide ([C(6)MIM]Br) solution was selected as solvent. In addition, the microwave parameters including irradiation power, extraction time and solid-liquid ratio were optimized. Compared with the regular MAE and conventional heat-reflux extraction (HRE), the proposed approach exhibited higher efficiency (0.9-43.7% enhanced) and shorter extraction time (from 2h to 2min), which indicated ILMAE was an efficient, rapid and simple sample preparation technique. Moreover, the proposed method was validated by the linearity, reproducibility, and recovery experiments. Good linearity was observed with the regression coefficients (r(2)) between 0.9998 and 0.9999. The recoveries of all methods were in the range of 94.6% and 105.5% with RSD lower than 6.6%, which indicated that the proposed method was credible.