RESUMO
MOTIVATION: Solanum sitiens is a self-incompatible wild relative of tomato, characterised by salt and drought resistance traits, with the potential to contribute through breeding programmes to crop improvement in cultivated tomato. This species has a distinct morphology, classification and ecotype compared to other stress resistant wild tomato relatives such as S. pennellii and S. chilense. Therefore, the availability of a reference genome for S. sitiens will facilitate the genetic and molecular understanding of salt and drought resistance. RESULTS: A high-quality de novo genome and transcriptome assembly for S. sitiens (Accession LA1974) has been developed. A hybrid assembly strategy was followed using Illumina short reads (â¼159X coverage) and PacBio long reads (â¼44X coverage), generating a total of â¼262 Gbp of DNA sequence. A reference genome of 1,245 Mbp, arranged in 1,483 scaffolds with a N50 of 1.826 Mbp was generated. Genome completeness was estimated at 95% using the Benchmarking Universal Single-Copy Orthologs (BUSCO) and the K-mer Analysis Tool (KAT). In addition, â¼63 Gbp of RNA-Seq were generated to support the prediction of 31,164 genes from the assembly, and to perform a de novo transcriptome. Lastly, we identified three large inversions compared to S. lycopersicum, containing several drought resistance related genes, such as beta-amylase 1 and YUCCA7. AVAILABILITY: S. sitiens (LA1974) raw sequencing, transcriptome and genome assembly have been deposited at the NCBI's Sequence Read Archive, under the BioProject number "PRJNA633104".All the commands and scripts necessary to generate the assembly are available at the following github repository: https://github.com/MCorentin/Solanum_sitiens_assembly. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.