Pesquisa | Portal Regional da BVS

Genome assembly and chemogenomic profiling of National Flower of Singapore Papilionanthe Miss Joaquim 'Agnes' reveals metabolic pathways regulating floral traits.

Lim, Abner Herbert; Low, Zhen Jie; Shingate, Prashant Narendra; Hong, Jing Han; Chong, Shu Chen; Ng, Cedric Chuan Young; Liu, Wei; Vaser, Robert; Sikic, Mile; Sung, Wing-Kin Ken; Nagarajan, Niranjan; Tan, Patrick; Teh, Bin Tean.

Commun Biol ; 5(1): 967, 2022 09 15.

Artigo em Inglês | MEDLINE | ID: mdl-36109650

RESUMO

Singapore's National Flower, Papilionanthe (Ple.) Miss Joaquim 'Agnes' (PMJ) is highly prized as a horticultural flower from the Orchidaceae family. A combination of short-read sequencing, single-molecule long-read sequencing and chromatin contact mapping was used to assemble the PMJ genome, spanning 2.5 Gb and 19 pseudo-chromosomal scaffolds. Genomic resources and chemical profiling provided insights towards identifying, understanding and elucidating various classes of secondary metabolite compounds synthesized by the flower. For example, presence of the anthocyanin pigments detected by chemical profiling coincides with the expression of ANTHOCYANIN SYNTHASE (ANS), an enzyme responsible for the synthesis of the former. Similarly, the presence of vandaterosides (a unique class of glycosylated organic acids with the potential to slow skin aging) discovered using chemical profiling revealed the involvement of glycosyltransferase family enzymes candidates in vandateroside biosynthesis. Interestingly, despite the unnoticeable scent of the flower, genes involved in the biosynthesis of volatile compounds and chemical profiling revealed the combination of oxygenated hydrocarbons, including traces of linalool, beta-ionone and vanillin, forming the scent profile of PMJ. In summary, by combining genomics and biochemistry, the findings expands the known biodiversity repertoire of the Orchidaceae family and insights into the genome and secondary metabolite processes of PMJ.

Assuntos

Antocianinas , Orchidaceae , Cromatina/metabolismo , Flores/genética , Flores/metabolismo , Regulação da Expressão Gênica de Plantas , Glicosiltransferases/genética , Redes e Vias Metabólicas , Orchidaceae/genética , Singapura

Time- and memory-efficient genome assembly with Raven.

Vaser, Robert; Sikic, Mile.

Nat Comput Sci ; 1(5): 332-336, 2021 May.

Artigo em Inglês | MEDLINE | ID: mdl-38217213

RESUMO

Whole genome sequencing technologies are unable to invariably read DNA molecules intact, a shortcoming that assemblers try to resolve by stitching the obtained fragments back together. Here, we present methods for the improvement of de novo genome assembly from erroneous long reads incorporated into a tool called Raven. Raven maintains similar performance for various genomes and has accuracy on par with other assemblers that support third-generation sequencing data. It is one of the fastest options while having the lowest memory consumption on the majority of benchmarked datasets.

Fast and accurate de novo genome assembly from long uncorrected reads.

Vaser, Robert; Sovic, Ivan; Nagarajan, Niranjan; Sikic, Mile.

Genome Res ; 27(5): 737-746, 2017 05.

Artigo em Inglês | MEDLINE | ID: mdl-28100585

RESUMO

The assembly of long reads from Pacific Biosciences and Oxford Nanopore Technologies typically requires resource-intensive error-correction and consensus-generation steps to obtain high-quality assemblies. We show that the error-correction step can be omitted and that high-quality consensus sequences can be generated efficiently with a SIMD-accelerated, partial-order alignment-based, stand-alone consensus module called Racon. Based on tests with PacBio and Oxford Nanopore data sets, we show that Racon coupled with miniasm enables consensus genomes with similar or better quality than state-of-the-art methods while being an order of magnitude faster.

Assuntos

Algoritmos , Mapeamento de Sequências Contíguas/métodos , Genômica/métodos , Alinhamento de Sequência/métodos , Análise de Sequência de DNA/métodos , Mapeamento de Sequências Contíguas/normas , Genômica/normas , Alinhamento de Sequência/normas , Análise de Sequência de DNA/normas

SWORD-a highly efficient protein database search.

Vaser, Robert; Pavlovic, Dario; Sikic, Mile.

Bioinformatics ; 32(17): i680-i684, 2016 09 01.

Artigo em Inglês | MEDLINE | ID: mdl-27587689

RESUMO

MOTIVATION: Protein database search is one of the fundamental problems in bioinformatics. For decades, it has been explored and solved using different exact and heuristic approaches. However, exponential growth of data in recent years has brought significant challenges in improving already existing algorithms. BLAST has been the most successful tool for protein database search, but is also becoming a bottleneck in many applications. Due to that, many different approaches have been developed to complement or replace it. In this article, we present SWORD, an efficient protein database search implementation that runs 8-16 times faster than BLAST in the sensitive mode and up to 68 times faster in the fast and less accurate mode. It is designed to be used in nearly all database search environments, but is especially suitable for large databases. Its sensitivity exceeds that of BLAST for majority of input datasets and provides guaranteed optimal alignments. AVAILABILITY AND IMPLEMENTATION: Sword is freely available for download from https://github.com/rvaser/sword CONTACT: robert.vaser@fer.hr and mile.sikic@fer.hr SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Bases de Dados de Proteínas , Ferramenta de Busca , Alinhamento de Sequência , Algoritmos , Software

SIFT missense predictions for genomes.

Vaser, Robert; Adusumalli, Swarnaseetha; Leng, Sim Ngak; Sikic, Mile; Ng, Pauline C.

Nat Protoc ; 11(1): 1-9, 2016 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-26633127

RESUMO

The SIFT (sorting intolerant from tolerant) algorithm helps bridge the gap between mutations and phenotypic variations by predicting whether an amino acid substitution is deleterious. SIFT has been used in disease, mutation and genetic studies, and a protocol for its use has been previously published with Nature Protocols. This updated protocol describes SIFT 4G (SIFT for genomes), which is a faster version of SIFT that enables practical computations on reference genomes. Users can get predictions for single-nucleotide variants from their organism of interest using the SIFT 4G annotator with SIFT 4G's precomputed databases. The scope of genomic predictions is expanded, with predictions available for more than 200 organisms. Users can also run the SIFT 4G algorithm themselves. SIFT predictions can be retrieved for 6.7 million variants in 4 min once the database has been downloaded. If precomputed predictions are not available, the SIFT 4G algorithm can compute predictions at a rate of 2.6 s per protein sequence. SIFT 4G is available from http://sift-dna.org/sift4g.

Assuntos

Algoritmos , Genômica/métodos , Mutação de Sentido Incorreto/genética , Bases de Dados de Proteínas , Genômica/normas , Humanos , Anotação de Sequência Molecular , Fenótipo , Padrões de Referência

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA