A survey of the complex transcriptome from the highly polyploid sugarcane genome using full-length isoform sequencing and de novo assembly from short read sequencing.

Hoang, Nam V; Furtado, Agnelo; Mason, Patrick J; Marquardt, Annelie; Kasirajan, Lakshmi; Thirugnanasambandam, Prathima P; Botha, Frederik C; Henry, Robert J

Hoang, Nam V; Furtado, Agnelo; Mason, Patrick J; Marquardt, Annelie; Kasirajan, Lakshmi; Thirugnanasambandam, Prathima P; Botha, Frederik C; Henry, Robert J.

Afiliação

Hoang NV; Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Room 2.245, Level 2, The John Hay Building, Queensland Biosciences Precinct [#80], 306 Carmody Road, St. Lucia, QLD, 4072, Australia.
Furtado A; College of Agriculture and Forestry, Hue University, Hue, Vietnam.
Mason PJ; Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Room 2.245, Level 2, The John Hay Building, Queensland Biosciences Precinct [#80], 306 Carmody Road, St. Lucia, QLD, 4072, Australia.
Marquardt A; Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Room 2.245, Level 2, The John Hay Building, Queensland Biosciences Precinct [#80], 306 Carmody Road, St. Lucia, QLD, 4072, Australia.
Kasirajan L; Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Room 2.245, Level 2, The John Hay Building, Queensland Biosciences Precinct [#80], 306 Carmody Road, St. Lucia, QLD, 4072, Australia.
Thirugnanasambandam PP; Sugar Research Australia, Indooroopilly, QLD, 4068, Australia.
Botha FC; Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Room 2.245, Level 2, The John Hay Building, Queensland Biosciences Precinct [#80], 306 Carmody Road, St. Lucia, QLD, 4072, Australia.
Henry RJ; ICAR - Sugarcane Breeding Institute, Coimbatore, Tamil Nadu, India.

BMC Genomics ; 18(1): 395, 2017 05 22.

Article em En | MEDLINE | ID: mdl-28532419

RESUMO

BACKGROUND: Despite the economic importance of sugarcane in sugar and bioenergy production, there is not yet a reference genome available. Most of the sugarcane transcriptomic studies have been based on Saccharum officinarum gene indices (SoGI), expressed sequence tags (ESTs) and de novo assembled transcript contigs from short-reads; hence knowledge of the sugarcane transcriptome is limited in relation to transcript length and number of transcript isoforms. RESULTS: The sugarcane transcriptome was sequenced using PacBio isoform sequencing (Iso-Seq) of a pooled RNA sample derived from leaf, internode and root tissues, of different developmental stages, from 22 varieties, to explore the potential for capturing full-length transcript isoforms. A total of 107,598 unique transcript isoforms were obtained, representing about 71% of the total number of predicted sugarcane genes. The majority of this dataset (92%) matched the plant protein database, while just over 2% was novel transcripts, and over 2% was putative long non-coding RNAs. About 56% and 23% of total sequences were annotated against the gene ontology and KEGG pathway databases, respectively. Comparison with de novo contigs from Illumina RNA-Sequencing (RNA-Seq) of the internode samples from the same experiment and public databases showed that the Iso-Seq method recovered more full-length transcript isoforms, had a higher N50 and average length of largest 1,000 proteins; whereas a greater representation of the gene content and RNA diversity was captured in RNA-Seq. Only 62% of PacBio transcript isoforms matched 67% of de novo contigs, while the non-matched proportions were attributed to the inclusion of leaf/root tissues and the normalization in PacBio, and the representation of more gene content and RNA classes in the de novo assembly, respectively. About 69% of PacBio transcript isoforms and 41% of de novo contigs aligned with the sorghum genome, indicating the high conservation of orthologs in the genic regions of the two genomes. CONCLUSIONS: The transcriptome dataset should contribute to improved sugarcane gene models and sugarcane protein predictions; and will serve as a reference database for analysis of transcript expression in sugarcane.

Assuntos

Perfilação da Expressão Gênica; Genômica; Poliploidia; Isoformas de RNA/genética; Saccharum/genética; Análise de Sequência de RNA; Processamento Alternativo; Etiquetas de Sequências Expressas/metabolismo; Sequenciamento de Nucleotídeos em Larga Escala; Anotação de Sequência Molecular; RNA Mensageiro/genética

Palavras-chave

De novo assembly; Hybrid assembly; Isoform sequencing; Polyploid transcriptome; SUGIT database; Sugarcane; Transcriptome assembly

Texto completo

Adicionar na Minha BVS

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Poliploidia / Análise de Sequência de RNA / Perfilação da Expressão Gênica / Genômica / Saccharum / Isoformas de RNA Tipo de estudo: Prognostic_studies Idioma: En Revista: BMC Genomics Assunto da revista: GENETICA Ano de publicação: 2017 Tipo de documento: Article País de afiliação: Austrália País de publicação: Reino Unido

Texto completo

Adicionar na Minha BVS

Imprimir

XML

PubMed Links

Buscar no Google