Your browser doesn't support javascript.
loading
Genome assembly of Erythrophleum Fordii, a special "ironwood" tree in China.
Wen, Chang-Yu; Lian, Ju-Yu; Peng, Wei-Xiong; Wang, Zheng-Feng; Yang, Zhi-Gang; Cao, Hong-Lin.
Affiliation
  • Wen CY; Guangdong Forestry Survey and Planning Institute, Guangzhou, 510520, China.
  • Lian JY; Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China.
  • Peng WX; Key Laboratory of Vegetation Restoration and Management of Degraded Ecosystems, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China.
  • Wang ZF; South China National Botanical Garden, Guangzhou, 510650, China.
  • Yang ZG; Guangdong Forestry Survey and Planning Institute, Guangzhou, 510520, China.
  • Cao HL; Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China. wzf@scib.ac.cn.
BMC Genom Data ; 24(1): 73, 2023 11 28.
Article in En | MEDLINE | ID: mdl-38017381
ABSTRACT

OBJECTIVES:

Erythrophleum is a genus in the Fabaceae family. The genus contains only about 10 species, and it is best known for its hardwood and medical properties worldwide. Erythrophleum fordii Oliv. is the only species of this genus distributed in China. It has superior wood and can be used in folk medicine, which leads to its overexploitation in the wild. For its effective conservation and elucidation of the distinctive genetic traits of wood formation and medical components, we present its first genome assembly. DATA DESCRIPTION This work generated ~ 160.8 Gb raw Nanopore whole genome sequencing (WGS) long reads, ~ 126.0 Gb raw MGI WGS short reads and ~ 29.0 Gb raw RNA-seq reads using E. fordii leaf tissues. The de novo assembly contained 864,825,911 bp in the E. fordii genome, with 59 contigs and a contig N50 of 30,830,834 bp. Benchmarking Universal Single-Copy Orthologs (BUSCO) revealed 98.7% completeness of the assembly. The assembly contained 471,006,885 bp (54.4%) repetitive sequences and 28,761 genes that coded for 33,803 proteins. The protein sequences were functionally annotated against multiple databases, facilitating comparative genomic analysis.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Trees / Fabaceae Country/Region as subject: Asia Language: En Journal: BMC Genom Data Year: 2023 Document type: Article Affiliation country:

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Trees / Fabaceae Country/Region as subject: Asia Language: En Journal: BMC Genom Data Year: 2023 Document type: Article Affiliation country: