Your browser doesn't support javascript.
loading
High-quality chromosome scale genome assemblies of two important Sorghum inbred lines, Tx2783 and RTx436.
Wang, Bo; Chougule, Kapeel; Jiao, Yinping; Olson, Andrew; Kumar, Vivek; Gladman, Nicholas; Huang, Jian; Llaca, Victor; Fengler, Kevin; Wei, Xuehong; Wang, Liya; Wang, Xiaofei; Regulski, Michael; Drenkow, Jorg; Gingeras, Thomas; Hayes, Chad; Armstrong, J Scott; Huang, Yinghua; Xin, Zhanguo; Ware, Doreen.
Afiliação
  • Wang B; Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
  • Chougule K; Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
  • Jiao Y; Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
  • Olson A; Texas Tech University, 1006 Canton Ave, Lubbock, TX 79409-2122, USA.
  • Kumar V; Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
  • Gladman N; Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
  • Huang J; Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
  • Llaca V; USDA ARS Robert W. Holley Center for Agriculture and Health Cornell University, Ithaca, NY, USA.
  • Fengler K; Department of Plant and Soil Sciences, Oklahoma State University, Stillwater, OK 74078-6028, USA.
  • Wei X; Corteva Agriscience™, 8325 NW 62nd Avenue, Johnston, IA 50131, USA.
  • Wang L; Corteva Agriscience™, 8325 NW 62nd Avenue, Johnston, IA 50131, USA.
  • Wang X; Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
  • Regulski M; Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
  • Drenkow J; Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
  • Gingeras T; Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
  • Hayes C; Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
  • Armstrong JS; Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
  • Huang Y; U.S. Department of Agriculture-Agricultural Research Service, Plant Stress and Germplasm Development Unit, Cropping Systems Research Laboratory, Lubbock, TX 79415, USA.
  • Xin Z; Peanut and Small Grains Research Unit, 1301 N. Western Rd. Stillwater, OK 74075, USA.
  • Ware D; USDA-ARS Plant Science Research Laboratory, 1301 N. Western Road, Stillwater, OK 74075-2714, USA.
NAR Genom Bioinform ; 6(3): lqae097, 2024 Sep.
Article em En | MEDLINE | ID: mdl-39131819
ABSTRACT
Sorghum bicolor (L.) Moench is a significant grass crop globally, known for its genetic diversity. High quality genome sequences are needed to capture the diversity. We constructed high-quality, chromosome-level genome assemblies for two vital sorghum inbred lines, Tx2783 and RTx436. Through advanced single-molecule techniques, long-read sequencing and optical maps, we improved average sequence continuity 19-fold and 11-fold higher compared to existing Btx623 v3.0 reference genome and obtained 19 and 18 scaffolds (N50 of 25.6 and 14.4) for Tx2783 and RTx436, respectively. Our gene annotation efforts resulted in 29 612 protein-coding genes for the Tx2783 genome and 29 265 protein-coding genes for the RTx436 genome. Comparative analyses with 26 plant genomes which included 18 sorghum genomes and 8 outgroup species identified around 31 210 protein-coding gene families, with about 13 956 specific to sorghum. Using representative models from gene trees across the 18 sorghum genomes, a total of 72 579 pan-genes were identified, with 14% core, 60% softcore and 26% shell genes. We identified 99 genes in Tx2783 and 107 genes in RTx436 that showed functional enrichment specifically in binding and metabolic processes, as revealed by the GO enrichment Pearson Chi-Square test. We detected 36 potential large inversions in the comparison between the BTx623 Bionano map and the BTx623 v3.1 reference sequence. Strikingly, these inversions were notably absent when comparing Tx2783 or RTx436 with the BTx623 Bionano map. These inversion were mostly in the pericentromeric region which is known to have low complexity regions and harder to assemble and suggests the presence of potential artifacts in the public BTx623 reference assembly. Furthermore, in comparison to Tx2783, RTx436 exhibited 324 883 additional Single Nucleotide Polymorphisms (SNPs) and 16 506 more Insertions/Deletions (INDELs) when using BTx623 as the reference genome. We also characterized approximately 348 nucleotide-binding leucine-rich repeat (NLR) disease resistance genes in the two genomes. These high-quality genomes serve as valuable resources for discovering agronomic traits and structural variation studies.

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article