Your browser doesn't support javascript.
loading
Montrer: 20 | 50 | 100
Résultats 1 - 20 de 1.584
Filtrer
1.
G3 (Bethesda) ; 2024 Jul 26.
Article de Anglais | MEDLINE | ID: mdl-39058924

RÉSUMÉ

Potato (Solanum tuberosum) is an essential crop for food security and is ranked as the third most important crop worldwide for human consumption. The Diacol Capiro cultivar holds the dominant position in Colombian cultivation, primarily catering to the food processing industry. This highly heterozygous, autotetraploid cultivar belongs to the Andigenum group and it stands out for its adaptation to a wide variety of environments spanning altitudes from 1,800 to 3,200 meters above sea level. Here, a chromosome-scale assembly, referred to as DC, is presented for this cultivar. The assembly was generated by combining circular consensus sequencing with proximity ligation Hi-C for the scaffolding and represents 2.369 Gb with 48 pseudochromosomes covering 2,091 Gb and an anchor rate of 88.26%. The reference genome metrics, including an N50 of 50.5 Mb, a BUSCO (Benchmarking Universal Single-Copy Orthologue) score of 99.38%, and an Long Terminal Repeat Assembly Index score of 13.53, collectively signal the achieved high assembly quality. A comprehensive annotation yielded a total of 154,114 genes, and the associated BUSCO score of 95.78% for the annotated sequences attests to their completeness. The number of predicted NLR (Nucleotide-Binding and Leucine-Rich-Repeat genes) was 2107 with a large representation of NBARC (for nucleotide binding domain shared by Apaf-1, certain R gene products, and CED-4) containing domains (99.85%). Further comparative analysis of the proposed annotation-based assembly with high-quality known potato genomes, showed a similar genome metrics with differences in total gene numbers related to the ploidy status. The genome assembly and annotation of DC presented in this study represent a valuable asset for comprehending potato genetics. This resource aids in targeted breeding initiatives and contributes to the creation of enhanced, resilient, and more productive potato varieties, particularly beneficial for countries in Latin America.

2.
Data Brief ; 55: 110596, 2024 Aug.
Article de Anglais | MEDLINE | ID: mdl-39022690

RÉSUMÉ

The pale chub, Zacco platypus (Cypriniformes; Xenocyprididae; homotypic synonym: Opsariichthys platypus; Jordan & Evermann, 1902), is widely distributed in the freshwater ecosystems throughout East Asia, including South Korea. In this study, we constructed a de novo genome assembly of Z. platypus to serve as a reference for fundamental and applied research. The assembly was generated using a combination of long-read Pacific Bioscience (PacBio) sequencing, short-read Illumina sequencing, and Hi-C sequencing technologies. The draft genome of Z. platypus consisted of 16,422,113 reads from the HiFi library, 702,143,130 reads from the Illumina TruSeq library, and 250,789,660 reads from the Hi-C library. Assembly with Hifiasm resulted in 336 contigs, with an N50 length of 31.9 Mb. The final assembled genome size was 838.6 Mb. Benchmarking Universal Single-Copy Orthologs (BUSCO) analysis indicated that 3,572 (98.1 %) of the expected genes were found in the assembly, with 3,521 (96.7 %) being single-copy and 51 (1.4 %) duplicated after searching against the Actinopterygii database. Of the 319 Hi-C scaffolds, 24 exceeded 10 Mb were thus classified as chromosome-level scaffolds. The assembled genome comprises 41.45 % repeat sequences. Gene annotation was performed using Illumina RNA-Seq and PacBio Iso-Seq data, based on repeat-masked genome sequences. The final annotation resulted in 34,036 protein-coding genes. This chromosomal-level genome assembly is expected to be a valuable resource for future health assessments in aquatic ecosystems, providing insights into the developmental, environmental, and ecological aspects of Z. platypus.

3.
G3 (Bethesda) ; 2024 Jul 20.
Article de Anglais | MEDLINE | ID: mdl-39031588

RÉSUMÉ

Drosophila subobscura is distributed across Europe, the Near East, and the Americas, while its sister species, D. madeirensis, is endemic to the island of Madeira in the Atlantic Ocean. D. subobscura is known for its strict light-dependence in mating and its unique courtship displays, including nuptial gift giving. D. subobscura has also attracted the interest of researchers because of its abundant variations in chromosomal polymorphisms correlated to the latitude and season, which have been used as a tool to track global climate warming. Although D. madeirensis can be an important resource for understanding the evolutionary underpinning of these genetic characteristics of D. subobscura, little work has been done on the biology of this species. Here, we used a HiFi long-read sequencing dataset to produce a de novo genome assembly for D. madeirensis. This assembly comprises a total of 111 contigs spanning 135.5 Mb, and has an N50 of 24.2 Mb and a BUSCO completeness score of 98.6%. Each of the six chromosomes of D. madeirensis consisted of a single contig except for some centromeric regions. Breakpoints of the chromosomal inversions between D. subobscura and D. madeirensis were characterized using this genome assembly, updating some of the previously identified locations.

4.
New Phytol ; 2024 Jul 19.
Article de Anglais | MEDLINE | ID: mdl-39031984

RÉSUMÉ

Cananga odorata is known as a natural perfume tree of the Annonaceae family in Magnoliales. However, its phylogenetic position and the molecular mechanisms involved in the biosynthesis of the floral volatile organic compounds (VOCs) remain unclear. Here, by combining a variety of sequencing platforms, we present a telomere-to-telomere (T2T) genome of C. odorata with 735.83 Mb, which represents the highest integrity and assembly quality of genome in magnoliid plants reported to date. Phylogenetic analysis based on multiple datasets and approaches showed that C. odorata, as a member of magnoliids, is sister to eudicots, after their divergence from monocots. Metabolomic of VOCs in the essential oil and flowers scent showed that sesquiterpenes, especially ß-caryophyllene, were the major compounds. Two CoTPS21 homologues derived from tandem duplication events were highly expressed during flower development and were identified as the key sesquiterpene synthases for the production of ß-caryophyllene. In addition, CoSPL3 and CoSPL9 were considered as potential transcription factors for activating the expression of CoTPS21 homologues. Our results shed light on the molecular mechanisms underlying the biosynthesis of the unique floral fragrance in C. odorata and provide new insights into the phylogenetic position of magnoliids.

6.
G3 (Bethesda) ; 2024 Jul 08.
Article de Anglais | MEDLINE | ID: mdl-38973709

RÉSUMÉ

The giant freshwater prawn (Macrobrachium rosenbergii) is a key species in the aquaculture industry in several Asian, African and South American countries. Despite a considerable growth in its production worldwide, the genetic complexities of M. rosenbergii various morphotypes pose challenges in cultivation. This study reports the first chromosome-scale reference genome and a high-quality full-length transcriptome assembly for M. rosenbergii. We employed the PacBio High Fidelity (HiFi) sequencing to obtain an initial draft assembly and further scaffolded it with the chromatin contact mapping (Hi-C) technique to achieve a final assembly of 3.73-Gb with an N50 scaffold length of 33.6 Mb. Repetitive elements constituted nearly 60% of the genome assembly, with simple sequence repeats and retrotransposons being the most abundant. The availability of both the chromosome-scale assembly and the full-length transcriptome assembly enabled us to thoroughly probe alternative splicing events in M. rosenbergii. Among the 2,041 events investigated, exon skipping represented the most prevalent class, followed by intron retention. Interestingly, specific isoforms were observed across multiple tissues. Additionally, within a single tissue type, transcripts could undergo alternative splicing, yielding multiple isoforms. We believe that the availability of a chromosome-level reference genome for M. rosenbergii along with its full-length transcriptome will be instrumental in advancing our understanding of the giant freshwater prawn biology and enhancing its molecular breeding programs, paving the way for the development of M. rosenbergii with valuable traits in commercial aquaculture.

7.
Int J Mol Sci ; 25(13)2024 Jun 28.
Article de Anglais | MEDLINE | ID: mdl-39000250

RÉSUMÉ

Beef is a major global source of protein, playing an essential role in the human diet. The worldwide production and consumption of beef continue to rise, reflecting a significant trend. However, despite the critical importance of beef cattle resources in agriculture, the diversity of cattle breeds faces severe challenges, with many breeds at risk of extinction. The initiation of the Beef Cattle Genome Project is crucial. By constructing a high-precision functional annotation map of their genome, it becomes possible to analyze the genetic mechanisms underlying important traits in beef cattle, laying a solid foundation for breeding more efficient and productive cattle breeds. This review details advances in genome sequencing and assembly technologies, iterative upgrades of the beef cattle reference genome, and its application in pan-genome research. Additionally, it summarizes relevant studies on the discovery of functional genes associated with key traits in beef cattle, such as growth, meat quality, reproduction, polled traits, disease resistance, and environmental adaptability. Finally, the review explores the potential of telomere-to-telomere (T2T) genome assembly, structural variations (SVs), and multi-omics techniques in future beef cattle genetic breeding. These advancements collectively offer promising avenues for enhancing beef cattle breeding and improving genetic traits.


Sujet(s)
Génome , Animaux , Bovins/génétique , Génomique/méthodes , Sélection/méthodes , Séquençage du génome entier/méthodes , Viande rouge , Locus de caractère quantitatif
8.
DNA Res ; 2024 Jul 01.
Article de Anglais | MEDLINE | ID: mdl-38946223

RÉSUMÉ

Tamarix austromongolica is endemic to the Yellow River Basin and has adapted to diverse ecological settings in the region, including the arid areas of northwestern China and the saline soil regions of the Yellow River Delta. However, the genetic basis of its local adaptation remains unclear. We report a chromosome-level assembly of the T. austromongolica genome based on PacBio high-fidelity sequencing and Hi-C technology. The 12 pseudochromosomes cover 98.44% of the 1.32 Gb assembly, with a contig N50 of 52.57 Mb and a BUSCO score of 98.2%. The genome comprises 913.6 Mb (68.83%) of repetitive sequences and 22,374 protein-coding genes. Genome evolution analyses suggest that genes under positive selection and significantly expanded gene families have facilitated T. austromongolica's adaptability to diverse environmental factors and high resistance to diseases. Using genotyping-by-sequencing, we conducted population structure and selection analyses of 114 samples from 15 sites. Two genetic groups were identified, and 114 and 289 candidate genes were assigned to the populations of the northwestern and eastern parts of the Yellow River, respectively. Furthermore, we discovered numerous candidate genes associated with high-altitude adaptability and salt tolerance. This research provides valuable genomic resources for the evolutionary study and genetic breeding of tamarisk.

9.
Genome Biol Evol ; 16(7)2024 Jul 03.
Article de Anglais | MEDLINE | ID: mdl-38946321

RÉSUMÉ

Oecanthus is a genus of cricket known for its distinctive chirping and distributed across major zoogeographical regions worldwide. This study focuses on Oecanthus rufescens, and conducts a comprehensive examination of its genome through genome sequencing technologies and bioinformatic analysis. A high-quality chromosome-level genome of O. rufescens was successfully obtained, revealing significant features of its genome structure. The genome size is 877.9 Mb, comprising ten pseudo-chromosomes and 70 other sequences, with a GC content of 41.38% and an N50 value of 157,110,771 bp, indicating a high level of continuity. BUSCO assessment results demonstrate that the genome's integrity and quality are high (of which 96.8% are single-copy and 1.6% are duplicated). Comprehensive genome annotation was also performed, identifying approximately 310 Mb of repetitive sequences, accounting for 35.3% of the total genome sequence, and discovering 15,481 tRNA genes, 4,082 rRNA genes, and 1,212 other noncoding genes. Furthermore, 15,031 protein-coding genes were identified, with BUSCO assessment results showing that 98.4% (of which 96.3% are single-copy and 1.6% are duplicated) of the genes were annotated.


Sujet(s)
Génome d'insecte , Annotation de séquence moléculaire , Animaux , Chromosomes d'insecte/génétique , Gryllidae/génétique , Orthoptera/génétique , Orthoptera/classification
10.
Cell ; 187(14): 3541-3562.e51, 2024 Jul 11.
Article de Anglais | MEDLINE | ID: mdl-38996487

RÉSUMÉ

Analyses of ancient DNA typically involve sequencing the surviving short oligonucleotides and aligning to genome assemblies from related, modern species. Here, we report that skin from a female woolly mammoth (†Mammuthus primigenius) that died 52,000 years ago retained its ancient genome architecture. We use PaleoHi-C to map chromatin contacts and assemble its genome, yielding 28 chromosome-length scaffolds. Chromosome territories, compartments, loops, Barr bodies, and inactive X chromosome (Xi) superdomains persist. The active and inactive genome compartments in mammoth skin more closely resemble Asian elephant skin than other elephant tissues. Our analyses uncover new biology. Differences in compartmentalization reveal genes whose transcription was potentially altered in mammoths vs. elephants. Mammoth Xi has a tetradic architecture, not bipartite like human and mouse. We hypothesize that, shortly after this mammoth's death, the sample spontaneously freeze-dried in the Siberian cold, leading to a glass transition that preserved subfossils of ancient chromosomes at nanometer scale.


Sujet(s)
Génome , Mammouths , Peau , Animaux , Mammouths/génétique , Génome/génétique , Femelle , Éléphants/génétique , Chromatine/génétique , Fossiles , ADN ancien/analyse , Souris , Humains , Chromosome X/génétique
11.
G3 (Bethesda) ; 2024 Jul 19.
Article de Anglais | MEDLINE | ID: mdl-39028118

RÉSUMÉ

The Hunt bumble bee, Bombus huntii, is a widely distributed pollinator in western North America. The species produces large colony sizes in captive rearing conditions, experiences low parasite and pathogen loads, and has been demonstrated to be an effective pollinator of tomatoes grown in controlled environment agriculture systems. These desirable traits have galvanized producer efforts to develop commercial B. huntii colonies for growers to deliver pollination services to crops. To better understand B. huntii biology and support population genetic studies and breeding decisions, we sequenced and assembled the B. huntii genome from a single haploid male. High-fidelity sequencing of the entire genome using PacBio, along with HiC sequencing, led to a comprehensive contig assembly of high continuity. This assembly was further organized into a chromosomal arrangement, successfully identifying 18 chromosomes spread across the 317.4 Mb assembly with a BUSCO score indicating 97.6% completeness. Synteny analysis demonstrates shared chromosome number (n = 18) with B. terrestris, a species belonging to a different subgenus, matching the expectation that presence of 18 haploid chromosomes is an ancestral trait at least between the subgenera Pyrobombus and Bombus sensu stricto. In conclusion, the assembly outcome, alongside the minimal tissue sampled destructively, showcase efficient techniques for producing a comprehensive, highly contiguous genome.

12.
Plant J ; 2024 Jul 23.
Article de Anglais | MEDLINE | ID: mdl-39039964

RÉSUMÉ

Vicia sativa ssp. amphicarpa is a unique forage crop capable of simultaneously producing fruits above and below ground, representing a typical amphicarpic plant. In this study, we sequenced and assembled seven pseudo-chromosomes of the genome of V. sativa ssp. amphicarpa (n = 7) yielding a genome size of 1.59 Gb, with a total annotation of 48 932 protein-coding genes. Long terminal repeat (LTR) elements constituted 62.28% of the genome, significantly contributing to the expansion of genome size. Phylogenetic analysis revealed that the divergence between V. sativa ssp. amphicarpa and V. sativa was around 0.88 million years ago (MYA). Comparative transcriptomic and metabolomic analysis of aerial and subterranean pod shells showed biosynthesis of terpenoids in the subterranean pod shells indicating a correlation between the antimicrobial activity of subterranean pod shells and the biosynthesis of terpenoids. Furthermore, functional validation indicates that overexpression of VsTPS5 and VsTPS16 enhances terpenoid biosynthesis for antibacterial activity. Metabolomic analysis suggests the involvement of terpenoids in the antimicrobial properties of subterranean pod shells. Deciphering the genome of V. sativa ssp. amphicarpa elucidated the molecular mechanisms behind the antimicrobial properties of subterranean fruits in amphicarpic plants, providing valuable insights for the study of amphicarpic plant biology.

13.
Mitochondrial DNA B Resour ; 9(7): 851-855, 2024.
Article de Anglais | MEDLINE | ID: mdl-38957225

RÉSUMÉ

Syzygium buxifolium. Hook. Et Arn.1833 is a member of the Myrtaceae family. This species is used in traditional Chinese medicines. It possesses numerous synonyms, reflecting the ambiguity in its taxonomy. The chloroplast genome has been widely used for species identification and phylogenetic analysis. Regrettably, there is a lack of information regarding the chloroplast genome of S. buxifolium. Here, we intend to obtain the chloroplast genome of S. buxifolium to resolve its classification problems. In particular, we utilized Illumina sequencing technology to sequence, GetOrganelle to assemble, and CPGAVAS2 to characterize the chloroplast genome of S. buxifolium. The chloroplast genome of S. buxifolium had a length of 158,581 bp and consisted of 111 unique genes, comprising 78 protein-coding genes, 29 transfer RNA (tRNA) genes, and four ribosomal RNA (rRNA) genes. In addition, we identified 86 Simple Sequence Repeats, 345 tandem repetitive sequences, and 34 dispersed repetitive sequences using modules implemented in CPGAVAS2. Lastly, we carried out phylogenetic analysis using Phylosuite. The results indicated a close relationship between S. buxifolium and S. grijsii. This study offers novel genetic data for the molecular identification and subsequent phylogenetic analysis of the Syzygium genus.

14.
Plant Physiol ; 2024 Jul 11.
Article de Anglais | MEDLINE | ID: mdl-38991561

RÉSUMÉ

Hybrid plants are found extensively in the wild, and they often demonstrate superior performance of complex traits over their parents and other selfing plants. This phenomenon, known as heterosis, has been extensively applied in plant breeding for decades. However, the process of decoding hybrid plant genomes has seriously lagged due to the challenges associated with genome assembly and the lack of appropriate methodologies for their subsequent representation and analysis. Here, we present the assembly and analysis of two hybrids, an intraspecific hybrid between two maize (Zea may ssp. mays) inbred lines and an interspecific hybrid between maize and its wild relative teosinte (Zea may ssp. parviglumis), utilizing a combination of PacBio High Fidelity (HiFi) sequencing and chromatin conformation capture sequencing data. The haplotypic assemblies are well-phased at chromosomal scale, successfully resolving the complex loci with extensive parental structural variations (SVs). By integrating into a bi-parental genome graph, the haplotypic assemblies can facilitate downstream short-reads-based SV calling and allele-specific gene expression analysis, demonstrating outstanding advantages over a single linear genome. Our work offers a comprehensive workflow that aims to facilitate the decoding of numerous hybrid plant genomes, particularly those with unknown or inaccessible parentage, thereby enhancing our understanding of genome evolution and heterosis.

15.
DNA Res ; 2024 Jul 17.
Article de Anglais | MEDLINE | ID: mdl-39017645

RÉSUMÉ

Gentiana straminea Maxim. is a perennial herb and mainly distributed in the Qinghai-Tibetan Plateau. To adapt to the extreme environment, it has developed particular morphological, physiological and genetic structures. Also, rich in iridoids, it is one of the original plants of traditional Chinese herb "Qinjiao". Herein, we present its first chromosome-level genome sequence assembly, and compare it with the genomes of other Gentiana species to facilitate the analysis of genomic characteristics. The assembled genome size of G. straminea was 1.25 Gb, with a contig N50 of 7.5 Mb. A total of 96.08% of the genome sequences was anchored on 13 pseudochromosomes, with a scaffold N50 of 92.70 Mb. A total of 54,310 protein-coding genes were predicted, 80.25% of which were functionally annotated. Comparative genomic analyses indicated that G. straminea experienced two whole-genome duplication events after the γ whole-genome triplication with other eudicots, and it diverged from other Gentiana species at ~3.2 Mya. A total of 142 enzyme-coding genes related to iridoid biosynthesis were identified in its genome. Additionally, we identified differences in the number and expression patterns of iridoid biosynthetic pathway genes in G. straminea compared with two other Gentiana species by integrating whole-genome sequence and transcriptomic analyses.

16.
F1000Res ; 13: 556, 2024.
Article de Anglais | MEDLINE | ID: mdl-38984017

RÉSUMÉ

Background: Determining the appropriate computational requirements and software performance is essential for efficient genomic surveillance. The lack of standardized benchmarking complicates software selection, especially with limited resources. Methods: We developed a containerized benchmarking pipeline to evaluate seven long-read assemblers-Canu, GoldRush, MetaFlye, Strainline, HaploDMF, iGDA, and RVHaplo-for viral haplotype reconstruction, using both simulated and experimental Oxford Nanopore sequencing data of HIV-1 and other viruses. Benchmarking was conducted on three computational systems to assess each assembler's performance, utilizing QUAST and BLASTN for quality assessment. Results: Our findings show that assembler choice significantly impacts assembly time, with CPU and memory usage having minimal effect. Assembler selection also influences the size of the contigs, with a minimum read length of 2,000 nucleotides required for quality assembly. A 4,000-nucleotide read length improves quality further. Canu was efficient among de novo assemblers but not suitable for multi-strain mixtures, while GoldRush produced only consensus assemblies. Strainline and MetaFlye were suitable for metagenomic sequencing data, with Strainline requiring high memory and MetaFlye operable on low-specification machines. Among reference-based assemblers, iGDA had high error rates, RVHaplo showed the best runtime and accuracy but became ineffective with similar sequences, and HaploDMF, utilizing machine learning, had fewer errors with a slightly longer runtime. Conclusions: The HIV-64148 pipeline, containerized using Docker, facilitates easy deployment and offers flexibility to select from a range of assemblers to match computational systems or study requirements. This tool aids in genome assembly and provides valuable information on HIV-1 sequences, enhancing viral evolution monitoring and understanding.


Sujet(s)
Biologie informatique , Génomique , VIH-1 (Virus de l'Immunodéficience Humaine de type 1) , Logiciel , VIH-1 (Virus de l'Immunodéficience Humaine de type 1)/génétique , Biologie informatique/méthodes , Génomique/méthodes , Humains , Génome viral/génétique
17.
BMC Genom Data ; 25(1): 70, 2024 Jul 15.
Article de Anglais | MEDLINE | ID: mdl-39009995

RÉSUMÉ

OBJECTIVES: Ants are ecologically dominant insects in most terrestrial ecosystems, with more than 14,000 extant species in about 340 genera recorded to date. However, genomic resources are still scarce for most species, especially for species endemic in East or Southeast Asia, limiting the study of phylogeny, speciation and adaptation of this evolutionarily successful animal lineage. Here, we assemble and annotate the genomes of Odontoponera transversa and Camponotus friedae, two ant species with a natural distribution in China, to facilitate future study of ant evolution. DATA DESCRIPTION: We obtained a total of 16 Gb and 51 Gb PacBio HiFi data for O. transversa and C. friedae, respectively, which were assembled into the draft genomes of 339 Mb for O. transversa and 233 Mb for C. friedae. Genome assessments by multiple metrics showed good completeness and high accuracy of the two assemblies. Gene annotations assisted by RNA-seq data yielded a comparable number of protein-coding genes in the two genomes (10,892 for O. transversa and 11,296 for C. friedae), while repeat annotations revealed a remarkable difference of repeat content between these two ant species (149.4 Mb for O. transversa versus 49.7 Mb for C. friedae). Besides, complete mitochondrial genomes for the two species were assembled and annotated.


Sujet(s)
Fourmis , Génome d'insecte , Animaux , Fourmis/génétique , Fourmis/classification , Génome d'insecte/génétique , Annotation de séquence moléculaire , Phylogenèse , Génomique/méthodes
18.
Int J Mol Sci ; 25(14)2024 Jul 17.
Article de Anglais | MEDLINE | ID: mdl-39063065

RÉSUMÉ

Bovicola ovis, commonly known as the sheep-biting louse, is an ectoparasite that adversely affects the sheep industry. Sheep louse infestation lowers the quality of products, including wool and leather, causing a loss of approximately AUD 123M per annum in Australia alone. The lack of a high-quality genome assembly for the sheep-biting louse, as well as any closely related livestock lice, has hindered the development of louse research and management control tools. In this study, we present the assembly of B. ovis with a genome size of ~123 Mbp based on a nanopore long-read sequencing library and Illumina RNA sequencing, complemented with a chromosome-level scaffolding using the Pore-C multiway chromatin contact dataset. Combining multiple alignment and gene prediction tools, a comprehensive annotation on the assembled B. ovis genome was conducted and recalled 11,810 genes as well as other genomic features including orf, ssr, rRNA and tRNA. A manual curation using alignment with the available closely related louse species, Pediculus humanus, increased the number of annotated genes to 16,024. Overall, this study reported critical genetic resources and biological insights for the advancement of sheep louse research and the development of sustainable control strategies in the sheep industry.


Sujet(s)
Séquençage par nanopores , Animaux , Séquençage par nanopores/méthodes , Ovis/parasitologie , Annotation de séquence moléculaire , Chromosomes/génétique , Maladies des ovins/parasitologie , Génome
19.
F1000Res ; 13: 583, 2024.
Article de Anglais | MEDLINE | ID: mdl-39050029

RÉSUMÉ

Improvements in DNA sequencing technology are allowing the dramatic increase of whole genome data for a wide variety of species. Such genome sequence data can assist the monitoring of intraspecific genetic diversity, but is often lacking for threatened species. In this project, we focused on the national Red List, a catalog of extinct and threatened species, issued by the Japanese government. We combined the data included in it with the record of genome assembly in NCBI and tabulated the assembly availability of the species in the list. The combined data shows a low percentage (2.1%) of the availability of whole genome sequence data for the taxa ranked on the Japanese Red List as well as a strong bias towards mammals and birds in Animalia and vascular plants in Plantae. Our data presentation highlights potential systematic limitations in genome sequencing (e.g., budget for sequencing large genomes of amphibians) and instructs future policies including which taxon needs more effort for genome sequencing. The resultant tables are available in the original website https://treethinkers.nig.ac.jp/redlist/ and are regularly updated.


Sujet(s)
Biodiversité , Espèce en voie de disparition , Génomique , Animaux , Génome , Génomique/méthodes , Japon
20.
Proc Natl Acad Sci U S A ; 121(31): e2322834121, 2024 Jul 30.
Article de Anglais | MEDLINE | ID: mdl-39042694

RÉSUMÉ

We developed a generally applicable method, CRISPR/Cas9-targeted long-read sequencing (CTLR-Seq), to resolve, haplotype-specifically, the large and complex regions in the human genome that had been previously impenetrable to sequencing analysis, such as large segmental duplications (SegDups) and their associated genome rearrangements. CTLR-Seq combines in vitro Cas9-mediated cutting of the genome and pulse-field gel electrophoresis to isolate intact large (i.e., up to 2,000 kb) genomic regions that encompass previously unresolvable genomic sequences. These targets are then sequenced (amplification-free) at high on-target coverage using long-read sequencing, allowing for their complete sequence assembly. We applied CTLR-Seq to the SegDup-mediated rearrangements that constitute the boundaries of, and give rise to, the 22q11.2 Deletion Syndrome (22q11DS), the most common human microdeletion disorder. We then performed de novo assembly to resolve, at base-pair resolution, the full sequence rearrangements and exact chromosomal breakpoints of 22q11.2DS (including all common subtypes). Across multiple patients, we found a high degree of variability for both the rearranged SegDup sequences and the exact chromosomal breakpoint locations, which coincide with various transposons within the 22q11.2 SegDups, suggesting that 22q11DS can be driven by transposon-mediated genome recombination. Guided by CTLR-Seq results from two 22q11DS patients, we performed three-dimensional chromosomal folding analysis for the 22q11.2 SegDups from patient-derived neurons and astrocytes and found chromosome interactions anchored within the SegDups to be both cell type-specific and patient-specific. Lastly, we demonstrated that CTLR-Seq enables cell-type specific analysis of DNA methylation patterns within the deletion haplotype of 22q11DS.


Sujet(s)
Syndrome de DiGeorge , Humains , Syndrome de DiGeorge/génétique , Systèmes CRISPR-Cas , Points de cassure de chromosome , Chromosomes humains de la paire 22/génétique , Génome humain , Réarrangement des gènes , Analyse de séquence d'ADN/méthodes , Délétion de segment de chromosome
SÉLECTION CITATIONS
DÉTAIL DE RECHERCHE