Búsqueda | Portal Regional de la BVS

1.

The sequence and de novo assembly of the genome of the Indian oil sardine, Sardinella longiceps.

Sukumaran, Sandhya; Sebastian, Wilson; Gopalakrishnan, A; Mathew, Oommen K; Vysakh, V G; Rohit, Prathibha; Jena, J K.

Sci Data ; 10(1): 565, 2023 08 25.

Artículo en Inglés | MEDLINE | ID: mdl-37626109

RESUMEN

The Indian oil sardine, Sardinella longiceps, is a widely distributed and commercially important small pelagic fish of the Northern Indian Ocean. The genome of the Indian oil sardine has been characterized using Illumina and Nanopore platforms. The assembly is 1.077 Gb (31.86 Mb Scaffold N50) in size with a repeat content of 23.24%. The BUSCO (Benchmarking Universal Single Copy Orthologues) completeness of the assembly is 93.5% when compared with Actinopterygii (ray finned fishes) data set. A total of 46316 protein coding genes were predicted. Sardinella longiceps is nutritionally rich with high levels of omega-3 polyunsaturated fatty acids (PUFA). The core genes for omega-3 PUFA biosynthesis, such as Elovl 1a and 1b,Elovl 2, Elovl 4a and 4b,Elovl 8a and 8b,and Fads 2, were observed in Sardinella longiceps. The presence of these genes may indicate the PUFA biosynthetic capability of Indian oil sardine, which needs to be confirmed functionally.

Asunto(s)

Peces , Genoma , Animales , Benchmarking , Peces/genética , Océano Índico

2.

Near-chromosomal de novo assembly of Bengal tiger genome reveals genetic hallmarks of apex predation.

Shukla, Harsh; Suryamohan, Kushal; Khan, Anubhab; Mohan, Krishna; Perumal, Rajadurai C; Mathew, Oommen K; Menon, Ramesh; Dixon, Mandumpala Davis; Muraleedharan, Megha; Kuriakose, Boney; Michael, Saju; Krishnankutty, Sajesh P; Zachariah, Arun; Seshagiri, Somasekar; Ramakrishnan, Uma.

Gigascience ; 122022 12 28.

Artículo en Inglés | MEDLINE | ID: mdl-36576130

RESUMEN

The tiger, a poster child for conservation, remains an endangered apex predator. Continued survival and recovery will require a comprehensive understanding of genetic diversity and the use of such information for population management. A high-quality tiger genome assembly will be an important tool for conservation genetics, especially for the Indian tiger, the most abundant subspecies in the wild. Here, we present high-quality near-chromosomal genome assemblies of a female and a male wild Indian tiger (Panthera tigris tigris). Our assemblies had a scaffold N50 of >140 Mb, with 19 scaffolds corresponding to the 19 numbered chromosomes, containing 95% of the genome. Our assemblies also enabled detection of longer stretches of runs of homozygosity compared to previous assemblies, which will help improve estimates of genomic inbreeding. Comprehensive genome annotation identified 26,068 protein-coding genes, including several gene families involved in key morphological features such as the teeth, claws, vision, olfaction, taste, and body stripes. We also identified 301 microRNAs, 365 small nucleolar RNAs, 632 transfer RNAs, and other noncoding RNA elements, several of which are predicted to regulate key biological pathways that likely contribute to the tiger's apex predatory traits. We identify signatures of positive selection in the tiger genome that are consistent with the Panthera lineage. Our high-quality genome will enable use of noninvasive samples for comprehensive assessment of genetic diversity, thus supporting effective conservation and management of wild tiger populations.

Asunto(s)

Conducta Predatoria , Tigres , Animales , Femenino , Masculino , Cromosomas , Genoma , Genómica , Tigres/genética

3.

PASS2.7: a database containing structure-based sequence alignments and associated features of protein domain superfamilies from SCOPe.

Bhattacharyya, Teerna; Nayak, Soumya; Goswami, Smit; Gadiyaram, Vasundhara; Mathew, Oommen K; Sowdhamini, Ramanathan.

Database (Oxford) ; 20222022 04 12.

Artículo en Inglés | MEDLINE | ID: mdl-35411388

RESUMEN

Sequence alignments are models that capture the structural, functional and evolutionary relationships between proteins. Structure-guided sequence alignments are helpful in the case of distantly related proteins with poor sequence identity, thus rendering routine sequence alignment methods ineffective. Protein Alignment organized as Structural Superfamilies or PASS2 database provides such sequence alignments of protein domains within a superfamily as per the Structural Classification of Proteins extended (SCOPe) database. The current update of PASS2 (i.e. PASS2.7) is following the latest release of SCOPe (2.07) and we provide data for 14 323 protein domains that are <40% identical and are organized into 2024 superfamilies. Several useful features derived from the alignments, such as conserved secondary structural motifs, HMMs and residues conserved across the superfamily, are also reported. Protein domains that are deviant from the rest of the members of a superfamily may compromise the quality of the alignment, and we found this to be the case in â¼7% of the total superfamilies we considered. To improve the alignment by objectively identifying such 'outliers', in this update, we have used a k-means-based unsupervised machine learning method for clustering superfamily members, where features provided were length of domains aligned, Cα-RMSD derived from the rigid-body superposition of all members and gaps contributed to the alignment by each domain. In a few cases, we have split the superfamily as per the clusters predicted and provided complete data for each cluster. A new feature included in this update is absolutely conserved interactions (ACIs) between residue backbones and side chains, which are obtained by aligning protein structure networks using structure-guided sequence alignments of superfamilies. ACIs provide valuable information about functionally important residues and the structure-function relationships of proteins. The ACIs and the corresponding conserved networks for backbone and sidechain have been marked on the superimposed structure separately. DATABASE URL: The updated version of the PASS2 database is available at http://caps.ncbs.res.in/pass2/.

Asunto(s)

Proteínas , Bases de Datos de Proteínas , Dominios Proteicos , Estructura Terciaria de Proteína , Proteínas/química , Proteínas/genética , Alineación de Secuencia

4.

Phenotypic expression and clinical outcomes in a South Asian PRKAG2 cardiomyopathy cohort.

Ahamed, Hisham; Balegadde, Aniketh Vijay; Menon, Shilpa; Menon, Ramesh; Ramachandran, Aishwarya; Mathew, Navin; Natarajan, K U; Nair, Indu Ramachandran; Kannan, Rajesh; Shankar, Meghna; Mathew, Oommen K; Nguyen, Thong T; Gupta, Ravi; Stawiski, Eric W; Ramprasad, V L; Seshagiri, Somasekar; Phalke, Sameer.

Sci Rep ; 10(1): 20610, 2020 11 26.

Artículo en Inglés | MEDLINE | ID: mdl-33244021

RESUMEN

The PRKAG2 syndrome is a rare autosomal dominant phenocopy of sarcomeric hypertrophic cardiomyopathy (HCM), characterized by ventricular pre-excitation, progressive conduction system disease and left ventricular hypertrophy. This study describes the phenotype, genotype and clinical outcomes of a South-Asian PRKAG2 cardiomyopathy cohort over a 7-year period. Clinical, electrocardiographic, echocardiographic, and cardiac MRI data from 22 individuals with PRKAG2 variants (68% men; mean age 39.5 ± 18.1 years), identified at our HCM centre were studied prospectively. At initial evaluation, all of the patients were in NYHA functional class I or II. The maximum left ventricular wall thickness was 22.9 ± 8.7 mm and left ventricular ejection fraction was 53.4 ± 6.6%. Left ventricular hypertrophy was present in 19 individuals (86%) at baseline. 17 patients had an WPW pattern (77%). After a mean follow-up period of 7 years, 2 patients had undergone accessory pathway ablation, 8 patients (36%) underwent permanent pacemaker implantation (atrio-ventricular blocks-5; sinus node disease-2), 3 patients developed atrial fibrillation, 11 patients (50%) developed progressive worsening in NYHA functional class, and 6 patients (27%) experienced sudden cardiac death or equivalent. PRKAG2 cardiomyopathy must be considered in patients with HCM and progressive conduction system disease.

Asunto(s)

Proteínas Quinasas Activadas por AMP/genética , Pueblo Asiatico/genética , Cardiomiopatías/genética , Adolescente , Adulto , Fibrilación Atrial/genética , Niño , Estudios de Cohortes , Muerte Súbita Cardíaca , Ecocardiografía/métodos , Electrocardiografía/métodos , Femenino , Variación Genética/genética , Humanos , Hipertrofia Ventricular Izquierda/genética , Masculino , Persona de Mediana Edad , Linaje , Fenotipo , Función Ventricular Izquierda/genética , Adulto Joven

5.

A knowledge-driven protocol for prediction of proteins of interest with an emphasis on biosynthetic pathways.

Joshi, Adwait G; Harini, K; Meenakshi, Iyer; Shafi, K Mohamed; Pasha, Shaik Naseer; Mahita, Jarjapu; Sajeevan, Radha Sivarajan; Karpe, Snehal D; Ghosh, Pritha; Nitish, Sathyanarayanan; Gandhimathi, A; Mathew, Oommen K; Prasanna, Subramanian Hari; Malini, Manoharan; Mutt, Eshita; Naika, Mahantesha; Ravooru, Nithin; Rao, Rajas M; Shingate, Prashant N; Sukhwal, Anshul; Sunitha, Margaret S; Upadhyay, Atul K; Vinekar, Rithvik S; Sowdhamini, Ramanathan.

MethodsX ; 7: 101053, 2020.

Artículo en Inglés | MEDLINE | ID: mdl-33024710

RESUMEN

This protocol describes a stepwise process to identify proteins of interest from a query proteome derived from NGS data. We implemented this protocol on Moringa oleifera transcriptome to identify proteins involved in secondary metabolite and vitamin biosynthesis and ion transport. This knowledge-driven protocol identifies proteins using an integrated approach involving sensitive sequence search and evolutionary relationships. We make use of functionally important residues (FIR) specific for the query protein family identified through its homologous sequences and literature. We screen protein hits based on the clustering with true homologues through phylogenetic tree reconstruction complemented with the FIR mapping. The protocol was validated for the protein hits through qRT-PCR and transcriptome quantification. Our protocol demonstrated a higher specificity as compared to other methods, particularly in distinguishing cross-family hits. This protocol was effective in transcriptome data analysis of M. oleifera as described in Pasha et al.â¢Knowledge-driven protocol to identify secondary metabolite synthesizing protein in a highly specific manner.â¢Use of functionally important residues for screening of true hits.â¢Beneficial for metabolite pathway reconstruction in any (species, metagenomics) NGS data.

6.

Dataset for the combined transcriptome assembly of M. oleifera and functional annotation.

Shafi, K Mohamed; Joshi, Adwait G; Meenakshi, Iyer; Pasha, Shaik Naseer; Harini, K; Mahita, Jarjapu; Sajeevan, Radha Sivarajan; Karpe, Snehal D; Ghosh, Pritha; Nitish, Sathyanarayanan; Gandhimathi, A; Mathew, Oommen K; Prasanna, Subramanian Hari; Malini, Manoharan; Mutt, Eshita; Naika, Mahantesha; Ravooru, Nithin; Rao, Rajas M; Shingate, Prashant N; Sukhwal, Anshul; Sunitha, Margaret S; Upadhyay, Atul K; Vinekar, Rithvik S; Sowdhamini, Ramanathan.

Data Brief ; 30: 105416, 2020 Jun.

Artículo en Inglés | MEDLINE | ID: mdl-32309524

RESUMEN

In this paper, we present the data acquired during transcriptome analysis of the plant Moringa oleifera [1] from five different tissues (root, stem, leaf, flower and seed) by RNA sequencing. A total of 271 million reads were assembled with an N50 of 2094â¯bp. The combined transcriptome was assessed for transcript abundance across five tissues. The protein coding genes identified from the transcripts were annotated and used for orthology analysis. Further, enzymes involved in the biosynthesis of select medicinally important secondary metabolites, vitamins and ion transporters were identified and their expression levels across tissues were examined. The data generated by RNA sequencing has been deposited to NCBI public repository under the accession number PRJNA394193 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA394193).

7.

The Indian cobra reference genome and transcriptome enables comprehensive identification of venom toxins.

Suryamohan, Kushal; Krishnankutty, Sajesh P; Guillory, Joseph; Jevit, Matthew; Schröder, Markus S; Wu, Meng; Kuriakose, Boney; Mathew, Oommen K; Perumal, Rajadurai C; Koludarov, Ivan; Goldstein, Leonard D; Senger, Kate; Dixon, Mandumpala Davis; Velayutham, Dinesh; Vargas, Derek; Chaudhuri, Subhra; Muraleedharan, Megha; Goel, Ridhi; Chen, Ying-Jiun J; Ratan, Aakrosh; Liu, Peter; Faherty, Brendan; de la Rosa, Guillermo; Shibata, Hiroki; Baca, Miriam; Sagolla, Meredith; Ziai, James; Wright, Gus A; Vucic, Domagoj; Mohan, Sangeetha; Antony, Aju; Stinson, Jeremy; Kirkpatrick, Donald S; Hannoush, Rami N; Durinck, Steffen; Modrusan, Zora; Stawiski, Eric W; Wiley, Kristen; Raudsepp, Terje; Kini, R Manjunatha; Zachariah, Arun; Seshagiri, Somasekar.

Nat Genet ; 52(1): 106-117, 2020 01.

Artículo en Inglés | MEDLINE | ID: mdl-31907489

RESUMEN

Snakebite envenoming is a serious and neglected tropical disease that kills ~100,000 people annually. High-quality, genome-enabled comprehensive characterization of toxin genes will facilitate development of effective humanized recombinant antivenom. We report a de novo near-chromosomal genome assembly of Naja naja, the Indian cobra, a highly venomous, medically important snake. Our assembly has a scaffold N50 of 223.35 Mb, with 19 scaffolds containing 95% of the genome. Of the 23,248 predicted protein-coding genes, 12,346 venom-gland-expressed genes constitute the 'venom-ome' and this included 139 genes from 33 toxin families. Among the 139 toxin genes were 19 'venom-ome-specific toxins' (VSTs) that showed venom-gland-specific expression, and these probably encode the minimal core venom effector proteins. Synthetic venom reconstituted through recombinant VST expression will aid in the rapid development of safe and effective synthetic antivenom. Additionally, our genome could serve as a reference for snake genomes, support evolutionary studies and enable venom-driven drug discovery.

Asunto(s)

Biología Computacional/métodos , Venenos Elapídicos/análisis , Venenos Elapídicos/genética , Genoma , Naja naja/genética , Transcriptoma , Secuencia de Aminoácidos , Animales , Perfilación de la Expresión Génica , India , Homología de Secuencia

8.

The transcriptome enables the identification of candidate genes behind medicinal value of Drumstick tree (Moringa oleifera).

Pasha, Shaik Naseer; Shafi, K Mohamed; Joshi, Adwait G; Meenakshi, Iyer; Harini, K; Mahita, Jarjapu; Sajeevan, Radha Sivarajan; Karpe, Snehal D; Ghosh, Pritha; Nitish, Sathyanarayanan; Gandhimathi, A; Mathew, Oommen K; Prasanna, Subramanian Hari; Malini, Manoharan; Mutt, Eshita; Naika, Mahantesha; Ravooru, Nithin; Rao, Rajas M; Shingate, Prashant N; Sukhwal, Anshul; Sunitha, Margaret S; Upadhyay, Atul K; Vinekar, Rithvik S; Sowdhamini, Ramanathan.

Genomics ; 112(1): 621-628, 2020 01.

Artículo en Inglés | MEDLINE | ID: mdl-31048014

RESUMEN

Moringa oleifera is a plant well-known for its nutrition value, drought resistance and medicinal properties. cDNA libraries from five different tissues (leaf, root, stem, seed and flower) of M.â¯oleifera cultivar Bhagya were generated and sequenced. We developed a bioinformatics pipeline to assemble transcriptome, along with the previously published M.â¯oleifera genome, to predict 17,148 gene models. Few candidate genes related to biosynthesis of secondary metabolites, vitamins and ion transporters were identified. Expressions were further confirmed by real-time quantitative PCR experiments for few promising leads. Quantitative estimation of metabolites, as well as elemental analysis, was also carried out to support our observations. Enzymes in the biosynthesis of vitamins and metabolites like quercetin and kaempferol are highly expressed in leaves, flowers and seeds. The expression of iron transporters and calcium storage proteins were observed in root and leaves. In general, leaves retain the highest amount of small molecules of interest.

Asunto(s)

Perfilación de la Expresión Génica , Regulación de la Expresión Génica de las Plantas/fisiología , Moringa oleifera , Metabolismo Secundario/fisiología , Transcriptoma/fisiología , Biblioteca de Genes , Moringa oleifera/genética , Moringa oleifera/metabolismo

9.

Sepsis in the era of data-driven medicine: personalizing risks, diagnoses, treatments and prognoses.

Liu, Andrew C; Patel, Krishna; Vunikili, Ramya Dhatri; Johnson, Kipp W; Abdu, Fahad; Belman, Shivani Kamath; Glicksberg, Benjamin S; Tandale, Pratyush; Fontanez, Roberto; Mathew, Oommen K; Kasarskis, Andrew; Mukherjee, Priyabrata; Subramanian, Lakshminarayanan; Dudley, Joel T; Shameer, Khader.

Brief Bioinform ; 21(4): 1182-1195, 2020 07 15.

Artículo en Inglés | MEDLINE | ID: mdl-31190075

RESUMEN

Sepsis is a series of clinical syndromes caused by the immunological response to infection. The clinical evidence for sepsis could typically attribute to bacterial infection or bacterial endotoxins, but infections due to viruses, fungi or parasites could also lead to sepsis. Regardless of the etiology, rapid clinical deterioration, prolonged stay in intensive care units and high risk for mortality correlate with the incidence of sepsis. Despite its prevalence and morbidity, improvement in sepsis outcomes has remained limited. In this comprehensive review, we summarize the current landscape of risk estimation, diagnosis, treatment and prognosis strategies in the setting of sepsis and discuss future challenges. We argue that the advent of modern technologies such as in-depth molecular profiling, biomedical big data and machine intelligence methods will augment the treatment and prevention of sepsis. The volume, variety, veracity and velocity of heterogeneous data generated as part of healthcare delivery and recent advances in biotechnology-driven therapeutics and companion diagnostics may provide a new wave of approaches to identify the most at-risk sepsis patients and reduce the symptom burden in patients within shorter turnaround times. Developing novel therapies by leveraging modern drug discovery strategies including computational drug repositioning, cell and gene-therapy, clustered regularly interspaced short palindromic repeats -based genetic editing systems, immunotherapy, microbiome restoration, nanomaterial-based therapy and phage therapy may help to develop treatments to target sepsis. We also provide empirical evidence for potential new sepsis targets including FER and STARD3NL. Implementing data-driven methods that use real-time collection and analysis of clinical variables to trace, track and treat sepsis-related adverse outcomes will be key. Understanding the root and route of sepsis and its comorbid conditions that complicate treatment outcomes and lead to organ dysfunction may help to facilitate identification of most at-risk patients and prevent further deterioration. To conclude, leveraging the advances in precision medicine, biomedical data science and translational bioinformatics approaches may help to develop better strategies to diagnose and treat sepsis in the next decade.

Asunto(s)

Medicina de Precisión , Sepsis/diagnóstico , Sepsis/terapia , Humanos , Pronóstico , Factores de Riesgo , Sepsis/patología

10.

Draft Genome Sequence of Freshwater-Derived Streptomyces sp. Strain BPSDS2, Isolated from Damte Stream, Northeast India.

Passari, Ajit Kumar; Deka, Purbajyoti; Rajput, Vinay; Priya, Lakshmi P M; Dharne, Mahesh; Dastager, Syed; Mathew, Oommen K; Hashem, Abeer; Abd Allah, Elsayed Fathi; Singh, Bhim Pratap.

Microbiol Resour Announc ; 8(43)2019 Oct 24.

Artículo en Inglés | MEDLINE | ID: mdl-31649095

RESUMEN

We report the draft genome sequence of Streptomyces sp. strain BPSDS2, isolated from freshwater sediments in Northeast India. The draft genome has a size of 8.27 Mb and 7,559 protein-coding sequences.

11.

Draft Genome Sequence of Streptomyces thermocarboxydus BPSAC147, a Potentially Plant Growth-Promoting Endophytic Bacterium.

Passari, Ajit Kumar; Rajput, Vinay; Priya, Lakshmi P M; Dharne, Mahesh; Dastager, Syed; Mathew, Oommen K; Hashem, Abeer; Abd Allah, Elsayed Fathi; Singh, Bhim Pratap.

Microbiol Resour Announc ; 8(23)2019 Jun 06.

Artículo en Inglés | MEDLINE | ID: mdl-31171621

RESUMEN

Streptomyces thermocarboxydus strain BPSAC147 is an endophytic actinobacterium isolated from root tissues of Rhynchotechum ellipticum in Mizoram, Northeast India. The strain showed potentially plant growth-promoting and biocontrol activities. The draft genome consists of 7.37 Mb and encodes 7,060 putative open reading frames.

12.

Draft Genome Sequence of Plant Growth-Promoting Endophytic Microbacterium hydrothermale BPSAC84, Isolated from the Medicinal Plant Mirabilis jalapa.

Passari, Ajit Kumar; Rajput, Vinay; Priya, Lakshmi P M; Dharne, Mahesh; Dastager, Syed; Mathew, Oommen K; Hashem, Abeer; Abd Allah, Elsayed Fathi; Singh, Bhim Pratap.

Microbiol Resour Announc ; 8(22)2019 May 30.

Artículo en Inglés | MEDLINE | ID: mdl-31147433

RESUMEN

Endophytic Microbacterium hydrothermale strain BPSAC84, which has antimicrobial potential, was isolated from root tissues of Mirabilis jalapa in Mizoram, Northeast India. The draft genome consists of 3.58 Mb and 3,444 protein-coding sequences.

13.

PASS2 version 6: a database of structure-based sequence alignments of protein domain superfamilies in accordance with SCOPe.

Ghosh, Pritha; Bhattacharyya, Teerna; Mathew, Oommen K; Sowdhamini, Ramanathan.

Database (Oxford) ; 20192019 01 01.

Artículo en Inglés | MEDLINE | ID: mdl-30820573

RESUMEN

The number of protein structures is increasing due to the individual initiatives and rapid development of structure determination techniques. Structure-based sequence alignments of distantly related proteins enable the investigation of structural, evolutionary and functional relationships between proteins and their domains leading to their common evolutionary origin. Protein Alignments organized as Structural Superfamilies (PASS2) is a database that provides such alignments of members of protein domain superfamilies of known structure and with less than 40% sequence identity. PASS2 has been continuously updated in accordance to Structural Classification of Proteins (SCOP), and now Structural Classification of Proteins - extended (SCOPe). The current update directly corresponds to SCOPe 2.06, dealing with 2006 domain superfamilies of known structure and about 14 000 domains. Alignments have been augmented by features such as hidden Markov models, highly conserved residues, structural motifs and gene ontology terms, which are available for download. In this update, we introduce the concepts of 'extreme structural outliers' and 'split superfamilies' as well.

Asunto(s)

Bases de Datos de Proteínas , Proteínas/química , Proteínas/clasificación , Alineación de Secuencia , Aminoácidos/genética , Secuencia Conservada , Filogenia , Estructura Terciaria de Proteína

14.

RStrucFam: a web server to associate structure and cognate RNA for RNA-binding proteins from sequence information.

Ghosh, Pritha; Mathew, Oommen K; Sowdhamini, Ramanathan.

BMC Bioinformatics ; 17(1): 411, 2016 Oct 07.

Artículo en Inglés | MEDLINE | ID: mdl-27717309

RESUMEN

BACKGROUND: RNA-binding proteins (RBPs) interact with their cognate RNA(s) to form large biomolecular assemblies. They are versatile in their functionality and are involved in a myriad of processes inside the cell. RBPs with similar structural features and common biological functions are grouped together into families and superfamilies. It will be useful to obtain an early understanding and association of RNA-binding property of sequences of gene products. Here, we report a web server, RStrucFam, to predict the structure, type of cognate RNA(s) and function(s) of proteins, where possible, from mere sequence information. RESULTS: The web server employs Hidden Markov Model scan (hmmscan) to enable association to a back-end database of structural and sequence families. The database (HMMRBP) comprises of 437 HMMs of RBP families of known structure that have been generated using structure-based sequence alignments and 746 sequence-centric RBP family HMMs. The input protein sequence is associated with structural or sequence domain families, if structure or sequence signatures exist. In case of association of the protein with a family of known structures, output features like, multiple structure-based sequence alignment (MSSA) of the query with all others members of that family is provided. Further, cognate RNA partner(s) for that protein, Gene Ontology (GO) annotations, if any and a homology model of the protein can be obtained. The users can also browse through the database for details pertaining to each family, protein or RNA and their related information based on keyword search or RNA motif search. CONCLUSIONS: RStrucFam is a web server that exploits structurally conserved features of RBPs, derived from known family members and imprinted in mathematical profiles, to predict putative RBPs from sequence information. Proteins that fail to associate with such structure-centric families are further queried against the sequence-centric RBP family HMMs in the HMMRBP database. Further, all other essential information pertaining to an RBP, like overall function annotations, are provided. The web server can be accessed at the following link: http://caps.ncbs.res.in/rstrucfam .

Asunto(s)

Proteínas de Unión al ARN/química , Proteínas de Unión al ARN/genética , ARN/química , ARN/genética , Secuencia de Aminoácidos , Humanos , Estructura Terciaria de Proteína

15.

PIMADb: A Database of Protein-Protein Interactions in Huge Macromolecular Assemblies.

Mathew, Oommen K; Sowdhamini, Ramanathan.

Bioinform Biol Insights ; 10: 105-9, 2016.

Artículo en Inglés | MEDLINE | ID: mdl-27478368

RESUMEN

Protein-protein interactions play a very important role in the process of cellular functionality. Intricate details about the interactions between the proteins in a macromolecular assembly are important to understand the function and significance of protein complexes. We are reporting about a database of protein-protein interactions in huge macromolecular assemblies (PIMADb) that records the intrinsic details of 189,532 interchain interactions in 40,049 complexes from the Protein Data Bank. These details include the results of the quantification and analysis of all the interactions in the complex. The availability of interprotomer interaction networks can enable the design of point mutation experiments. PIMADb can be accessed from the URL: http://caps.ncbs.res.in/pimadb.

16.

An Approach to Function Annotation for Proteins of Unknown Function (PUFs) in the Transcriptome of Indian Mulberry.

Dhanyalakshmi, K H; Naika, Mahantesha B N; Sajeevan, R S; Mathew, Oommen K; Shafi, K Mohamed; Sowdhamini, Ramanathan; N Nataraja, Karaba.

PLoS One ; 11(3): e0151323, 2016.

Artículo en Inglés | MEDLINE | ID: mdl-26982336

RESUMEN

The modern sequencing technologies are generating large volumes of information at the transcriptome and genome level. Translation of this information into a biological meaning is far behind the race due to which a significant portion of proteins discovered remain as proteins of unknown function (PUFs). Attempts to uncover the functional significance of PUFs are limited due to lack of easy and high throughput functional annotation tools. Here, we report an approach to assign putative functions to PUFs, identified in the transcriptome of mulberry, a perennial tree commonly cultivated as host of silkworm. We utilized the mulberry PUFs generated from leaf tissues exposed to drought stress at whole plant level. A sequence and structure based computational analysis predicted the probable function of the PUFs. For rapid and easy annotation of PUFs, we developed an automated pipeline by integrating diverse bioinformatics tools, designated as PUFs Annotation Server (PUFAS), which also provides a web service API (Application Programming Interface) for a large-scale analysis up to a genome. The expression analysis of three selected PUFs annotated by the pipeline revealed abiotic stress responsiveness of the genes, and hence their potential role in stress acclimation pathways. The automated pipeline developed here could be extended to assign functions to PUFs from any organism in general. PUFAS web server is available at http://caps.ncbs.res.in/pufas/ and the web service is accessible at http://capservices.ncbs.res.in/help/pufas.

Asunto(s)

Morus/genética , Proteínas de Plantas/genética , Transcriptoma , Secuencia de Aminoácidos , Datos de Secuencia Molecular , Morus/clasificación , Filogenia , Proteínas de Plantas/química , Proteínas de Plantas/metabolismo , Unión Proteica , Homología de Secuencia de Aminoácido

17.

PASS2 database for the structure-based sequence alignment of distantly related SCOP domain superfamilies: update to version 5 and added features.

Gandhimathi, Arumugam; Ghosh, Pritha; Hariharaputran, Sridhar; Mathew, Oommen K; Sowdhamini, R.

Nucleic Acids Res ; 44(D1): D410-4, 2016 Jan 04.

Artículo en Inglés | MEDLINE | ID: mdl-26553811

RESUMEN

Structure-based sequence alignment is an essential step in assessing and analysing the relationship of distantly related proteins. PASS2 is a database that records such alignments for protein domain superfamilies and has been constantly updated periodically. This update of the PASS2 version, named as PASS2.5, directly corresponds to the SCOPe 2.04 release. All SCOPe structural domains that share less than 40% sequence identity, as defined by the ASTRAL compendium of protein structures, are included. The current version includes 1977 superfamilies and has been assembled utilizing the structure-based sequence alignment protocol. Such an alignment is obtained initially through MATT, followed by a refinement through the COMPARER program. The JOY program has been used for structural annotations of such alignments. In this update, we have automated the protocol and focused on inclusion of new features such as mapping of GO terms, absolutely conserved residues among the domains in a superfamily and inclusion of PDBs, that are absent in SCOPe 2.04, using the HMM profiles from the alignments of the superfamily members and are provided as a separate list. We have also implemented a more user-friendly manner of data presentation and options for downloading more features. PASS2.5 version is available at http://caps.ncbs.res.in/pass2/.

Asunto(s)

Bases de Datos de Proteínas , Estructura Terciaria de Proteína , Alineación de Secuencia , Ontología de Genes , Proteínas/química , Proteínas/clasificación , Proteínas/genética

18.

Genome sequencing of herb Tulsi (Ocimum tenuiflorum) unravels key genes behind its strong medicinal properties.

Upadhyay, Atul K; Chacko, Anita R; Gandhimathi, A; Ghosh, Pritha; Harini, K; Joseph, Agnel P; Joshi, Adwait G; Karpe, Snehal D; Kaushik, Swati; Kuravadi, Nagesh; Lingu, Chandana S; Mahita, J; Malarini, Ramya; Malhotra, Sony; Malini, Manoharan; Mathew, Oommen K; Mutt, Eshita; Naika, Mahantesha; Nitish, Sathyanarayanan; Pasha, Shaik Naseer; Raghavender, Upadhyayula S; Rajamani, Anantharamanan; Shilpa, S; Shingate, Prashant N; Singh, Heikham Russiachand; Sukhwal, Anshul; Sunitha, Margaret S; Sumathi, Manojkumar; Ramaswamy, S; Gowda, Malali; Sowdhamini, Ramanathan.

BMC Plant Biol ; 15: 212, 2015 Aug 28.

Artículo en Inglés | MEDLINE | ID: mdl-26315624

RESUMEN

BACKGROUND: Krishna Tulsi, a member of Lamiaceae family, is a herb well known for its spiritual, religious and medicinal importance in India. The common name of this plant is 'Tulsi' (or 'Tulasi' or 'Thulasi') and is considered sacred by Hindus. We present the draft genome of Ocimum tenuiflurum L (subtype Krishna Tulsi) in this report. The paired-end and mate-pair sequence libraries were generated for the whole genome sequenced with the Illumina Hiseq 1000, resulting in an assembled genome of 374 Mb, with a genome coverage of 61 % (612 Mb estimated genome size). We have also studied transcriptomes (RNA-Seq) of two subtypes of O. tenuiflorum, Krishna and Rama Tulsi and report the relative expression of genes in both the varieties. RESULTS: The pathways leading to the production of medicinally-important specialized metabolites have been studied in detail, in relation to similar pathways in Arabidopsis thaliana and other plants. Expression levels of anthocyanin biosynthesis-related genes in leaf samples of Krishna Tulsi were observed to be relatively high, explaining the purple colouration of Krishna Tulsi leaves. The expression of six important genes identified from genome data were validated by performing q-RT-PCR in different tissues of five different species, which shows the high extent of urosolic acid-producing genes in young leaves of the Rama subtype. In addition, the presence of eugenol and ursolic acid, implied as potential drugs in the cure of many diseases including cancer was confirmed using mass spectrometry. CONCLUSIONS: The availability of the whole genome of O.tenuiflorum and our sequence analysis suggests that small amino acid changes at the functional sites of genes involved in metabolite synthesis pathways confer special medicinal properties to this herb.

Asunto(s)

Regulación de la Expresión Génica de las Plantas , Genoma de Planta , Ocimum/genética , India , Ocimum/metabolismo , Hojas de la Planta/metabolismo , Plantas Medicinales/genética , Plantas Medicinales/metabolismo

19.

DOCKSCORE: a webserver for ranking protein-protein docked poses.

Malhotra, Sony; Mathew, Oommen K; Sowdhamini, Ramanathan.

BMC Bioinformatics ; 16: 127, 2015 Apr 24.

Artículo en Inglés | MEDLINE | ID: mdl-25902779

RESUMEN

BACKGROUND: Proteins interact with a variety of other molecules such as nucleic acids, small molecules and other proteins inside the cell. Structure-determination of protein-protein complexes is challenging due to several reasons such as the large molecular weights of these macromolecular complexes, their dynamic nature, difficulty in purification and sample preparation. Computational docking permits an early understanding of the feasibility and mode of protein-protein interactions. However, docking algorithms propose a number of solutions and it is a challenging task to select the native or near native pose(s) from this pool. DockScore is an objective scoring scheme that can be used to rank protein-protein docked poses. It considers several interface parameters, namely, surface area, evolutionary conservation, hydrophobicity, short contacts and spatial clustering at the interface for scoring. RESULTS: We have implemented DockScore in form of a webserver for its use by the scientific community. DockScore webserver can be employed, subsequent to docking, to perform scoring of the docked solutions, starting from multiple poses as inputs. The results, on scores and ranks for all the poses, can be downloaded as a csv file and graphical view of the interface of best ranking poses is possible. CONCLUSIONS: The webserver for DockScore is made freely available for the scientific community at: http://caps.ncbs.res.in/dockscore/ .

Asunto(s)

Algoritmos , Internet , Simulación del Acoplamiento Molecular , Proteínas/química , Programas Informáticos , Proteínas Activadoras de ras GTPasa/metabolismo , Humanos , Unión Proteica , Dominios y Motivos de Interacción de Proteínas , Proteínas/metabolismo

20.

LenVarDB: database of length-variant protein domains.

Mutt, Eshita; Mathew, Oommen K; Sowdhamini, Ramanathan.

Nucleic Acids Res ; 42(Database issue): D246-50, 2014 Jan.

Artículo en Inglés | MEDLINE | ID: mdl-24194591

RESUMEN

Protein domains are functionally and structurally independent modules, which add to the functional variety of proteins. This array of functional diversity has been enabled by evolutionary changes, such as amino acid substitutions or insertions or deletions, occurring in these protein domains. Length variations (indels) can introduce changes at structural, functional and interaction levels. LenVarDB (freely available at http://caps.ncbs.res.in/lenvardb/) traces these length variations, starting from structure-based sequence alignments in our Protein Alignments organized as Structural Superfamilies (PASS2) database, across 731 structural classification of proteins (SCOP)-based protein domain superfamilies connected to 2 730 625 sequence homologues. Alignment of sequence homologues corresponding to a structural domain is available, starting from a structure-based sequence alignment of the superfamily. Orientation of the length-variant (indel) regions in protein domains can be visualized by mapping them on the structure and on the alignment. Knowledge about location of length variations within protein domains and their visual representation will be useful in predicting changes within structurally or functionally relevant sites, which may ultimately regulate protein function. Non-technical summary: Evolutionary changes bring about natural changes to proteins that may be found in many organisms. Such changes could be reflected as amino acid substitutions or insertions-deletions (indels) in protein sequences. LenVarDB is a database that provides an early overview of observed length variations that were set among 731 protein families and after examining >2 million sequences. Indels are followed up to observe if they are close to the active site such that they can affect the activity of proteins. Inclusion of such information can aid the design of bioengineering experiments.

Asunto(s)

Bases de Datos de Proteínas , Mutación INDEL , Estructura Terciaria de Proteína , Variación Genética , Internet , Modelos Moleculares , Estructura Terciaria de Proteína/genética , Proteínas/clasificación , Alineación de Secuencia , Análisis de Secuencia de Proteína

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA