RESUMO
Obtaining high-quality DNA suitable for long-read sequencing can be difficult for many types of tissues and cells, and it is a key step in current genomic studies. The challenge is even greater when it comes to isolating genomic DNA from mammalian spermatozoa, as DNA is tightly packed into a cell with a robust membrane rich in disulfide bonds. Here we describe a method for isolating high molecular weight DNA from Bovine commercial semen straws. This protocol includes a cleaning step to remove diluents and preservatives used for the long-term storage of the semen, which may affect long read sequencing. It is based on a simple salting-out method and avoid the use of spin columns, strong mixing or intensive centrifugation, in order to limit DNA fragmentation. However, we have adapted this protocol to facilitate the disruption of cell membranes and disulfide bonds with strong chaotropic and reducing agents. The average size of the fragments produced was approximately 49 kb, ranging from 25 to 85 kb, according to the femto pulse profiles.This method was used to isolate DNA from semen straws, more than 80 of them were successfully sequenced using the Continuous Long-Read (CLR) sequencing mode on the PacBio SequelII platform to study genome diversity and notably to detect large structural variations within genomes.
Assuntos
DNA , Genoma , Sêmen , Análise de Sequência de DNA , Animais , Bovinos , Masculino , DNA/isolamento & purificação , DNA/genética , Análise de Sequência de DNA/métodos , Espermatozoides , Sequenciamento de Nucleotídeos em Larga Escala/métodosRESUMO
Inspired by the production of reference data sets in the Genome in a Bottle project, we sequenced one Charolais heifer with different technologies: Illumina paired-end, Oxford Nanopore, Pacific Biosciences (HiFi and CLR), 10X Genomics linked-reads, and Hi-C. In order to generate haplotypic assemblies, we also sequenced both parents with short reads. From these data, we built two haplotyped trio high quality reference genomes and a consensus assembly, using up-to-date software packages. The assemblies obtained using PacBio HiFi reaches a size of 3.2 Gb, which is significantly larger than the 2.7 Gb ARS-UCD1.2 reference. The BUSCO score of the consensus assembly reaches a completeness of 95.8%, among highly conserved mammal genes. We also identified 35,866 structural variants larger than 50 base pairs. This assembly is a contribution to the bovine pangenome for the "Charolais" breed. These datasets will prove to be useful resources enabling the community to gain additional insight on sequencing technologies for applications such as SNP, indel or structural variant calling, and de novo assembly.
Assuntos
Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Animais , Bovinos , Feminino , Benchmarking , Genoma , Análise de Sequência de DNARESUMO
BACKGROUND: Structural Variations (SVs) are genomic rearrangements derived from duplication, deletion, insertion, inversion, and translocation events. In the past, SVs detection was limited to cytological approaches, then to Next-Generation Sequencing (NGS) short reads and partitioned assemblies. Nowadays, technologies such as DNA long read sequencing and optical mapping have revolutionized the understanding of SVs in genomes, due to the enhancement of the power of SVs detection. This study aims to investigate performance of two techniques, 1) long-read sequencing obtained with the MinION device (Oxford Nanopore Technologies) and 2) optical mapping obtained with Saphyr device (Bionano Genomics) to detect and characterize SVs in the genomes of the two ecotypes of Arabidopsis thaliana, Columbia-0 (Col-0) and Landsberg erecta 1 (Ler-1). RESULTS: We described the SVs detected from the alignment of the best ONT assembly and DLE-1 optical maps of A. thaliana Ler-1 against the public reference genome Col-0 TAIR10.1. After filtering (SV > 1 kb), 1184 and 591 Ler-1 SVs were retained from ONT and Bionano technologies respectively. A total of 948 Ler-1 ONT SVs (80.1%) corresponded to 563 Bionano SVs (95.3%) leading to 563 common locations. The specific locations were scrutinized to assess improvement in SV detection by either technology. The ONT SVs were mostly detected near TE and gene features, and resistance genes seemed particularly impacted. CONCLUSIONS: Structural variations linked to ONT sequencing error were removed and false positives limited, with high quality Bionano SVs being conserved. When compared with the Col-0 TAIR10.1 reference genome, most of the detected SVs discovered by both technologies were found in the same locations. ONT assembly sequence leads to more specific SVs than Bionano one, the latter being more efficient to characterize large SVs. Even if both technologies are complementary approaches, ONT data appears to be more adapted to large scale populations studies, while Bionano performs better in improving assembly and describing specificity of a genome compared to a reference.
Assuntos
Nanoporos , Genoma , Variação Estrutural do Genoma , Genômica/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodosRESUMO
Among crop fruit trees, the apricot (Prunus armeniaca) provides an excellent model to study divergence and adaptation processes. Here, we obtain nearly 600 Armeniaca apricot genomes and four high-quality assemblies anchored on genetic maps. Chinese and European apricots form two differentiated gene pools with high genetic diversity, resulting from independent domestication events from distinct wild Central Asian populations, and with subsequent gene flow. A relatively low proportion of the genome is affected by selection. Different genomic regions show footprints of selection in European and Chinese cultivated apricots, despite convergent phenotypic traits, with predicted functions in both groups involved in the perennial life cycle, fruit quality and disease resistance. Selection footprints appear more abundant in European apricots, with a hotspot on chromosome 4, while admixture is more pervasive in Chinese cultivated apricots. Our study provides clues to the biology of selected traits and targets for fruit tree research and breeding.
Assuntos
Domesticação , Genoma de Planta/genética , Prunus armeniaca/genética , Cromossomos de Plantas/genética , Resistência à Doença/genética , Evolução Molecular , Frutas/classificação , Frutas/genética , Frutas/crescimento & desenvolvimento , Fluxo Gênico , Variação Genética , Estágios do Ciclo de Vida/genética , Metagenômica , Fenótipo , Filogenia , Prunus armeniaca/classificação , Prunus armeniaca/crescimento & desenvolvimento , Seleção GenéticaRESUMO
Plant genomes are often characterized by a high level of repetitiveness and polyploid nature. Consequently, creating genome assemblies for plant genomes is challenging. The introduction of short-read technologies 10 years ago substantially increased the number of available plant genomes. Generally, these assemblies are incomplete and fragmented, and only a few are at the chromosome scale. Recently, Pacific Biosciences and Oxford Nanopore sequencing technologies were commercialized that can sequence long DNA fragments (kilobases to megabase) and, using efficient algorithms, provide high-quality assemblies in terms of contiguity and completeness of repetitive regions1-4. However, even though genome assemblies based on long reads exhibit high contig N50s (>1 Mb), these methods are still insufficient to decipher genome organization at the chromosome level. Here, we describe a strategy based on long reads (MinION or PromethION sequencers) and optical maps (Saphyr system) that can produce chromosome-level assemblies and demonstrate applicability by generating high-quality genome sequences for two new dicotyledon morphotypes, Brassica rapa Z1 (yellow sarson) and Brassica oleracea HDEM (broccoli), and one new monocotyledon, Musa schizocarpa (banana). All three assemblies show contig N50s of >5 Mb and contain scaffolds that represent entire chromosomes or chromosome arms.
Assuntos
Brassica rapa/genética , Brassica/genética , Mapeamento Cromossômico/métodos , Cromossomos de Plantas/genética , Genoma de Planta/genética , Nanoporos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Óptica e Fotônica/métodos , Sequências Repetitivas de Ácido Nucleico/genéticaRESUMO
Procambial and cambial stem cells provide the initial cells that allow the formation of vascular tissues. WOX4 and WOX14 have been shown to act redundantly to promote procambial cell proliferation and differentiation. Gibberellins (GAs), which have an important role in wood formation, also stimulate cambial cell division. Here we show that the loss of WOX14 function phenocopies some traits of GA-deficient mutants that can be complemented by exogenous GA application, whereas WOX14 overexpression stimulates the expression of GA3ox anabolism genes and represses GA2ox catabolism genes, promoting the accumulation of bioactive GA. More importantly, our data clearly indicate that WOX14 but not WOX4 promotes vascular cell differentiation and lignification in inflorescence stems of Arabidopsis.