RESUMO
We sequenced and assembled using multiple long-read sequencing technologies the genomes of chimpanzee, bonobo, gorilla, orangutan, gibbon, macaque, owl monkey, and marmoset. We identified 1,338,997 lineage-specific fixed structural variants (SVs) disrupting 1,561 protein-coding genes and 136,932 regulatory elements, including the most complete set of human-specific fixed differences. We estimate that 819.47 Mbp or â¼27% of the genome has been affected by SVs across primate evolution. We identify 1,607 structurally divergent regions wherein recurrent structural variation contributes to creating SV hotspots where genes are recurrently lost (e.g., CARD, C4, and OLAH gene families) and additional lineage-specific genes are generated (e.g., CKAP2, VPS36, ACBD7, and NEK5 paralogs), becoming targets of rapid chromosomal diversification and positive selection (e.g., RGPD gene family). High-fidelity long-read sequencing has made these dynamic regions of the genome accessible for sequence-level analyses within and between primate species.
Assuntos
Genoma , Primatas , Animais , Humanos , Sequência de Bases , Primatas/classificação , Primatas/genética , Evolução Biológica , Análise de Sequência de DNA , Variação Estrutural do GenomaRESUMO
In order to provide a comprehensive resource for human structural variants (SVs), we generated long-read sequence data and analyzed SVs for fifteen human genomes. We sequence resolved 99,604 insertions, deletions, and inversions including 2,238 (1.6 Mbp) that are shared among all discovery genomes with an additional 13,053 (6.9 Mbp) present in the majority, indicating minor alleles or errors in the reference. Genotyping in 440 additional genomes confirms the most common SVs in unique euchromatin are now sequence resolved. We report a ninefold SV bias toward the last 5 Mbp of human chromosomes with nearly 55% of all VNTRs (variable number of tandem repeats) mapping to this portion of the genome. We identify SVs affecting coding and noncoding regulatory loci improving annotation and interpretation of functional variation. These data provide the framework to construct a canonical human reference and a resource for developing advanced representations capable of capturing allelic diversity.
Assuntos
Frequência do Gene/genética , Genoma Humano/genética , Variação Estrutural do Genoma/genética , Alelos , Eucromatina/genética , Genômica/métodos , Humanos , Repetições Minissatélites/genética , Análise de Sequência de DNA/métodosRESUMO
We sequenced the MSY (male-specific region of the Y chromosome) of the C57BL/6J strain of the laboratory mouse Mus musculus. In contrast to theories that Y chromosomes are heterochromatic and gene poor, the mouse MSY is 99.9% euchromatic and contains about 700 protein-coding genes. Only 2% of the MSY derives from the ancestral autosomes that gave rise to the mammalian sex chromosomes. Instead, all but 45 of the MSY's genes belong to three acquired, massively amplified gene families that have no homologs on primate MSYs but do have acquired, amplified homologs on the mouse X chromosome. The complete mouse MSY sequence brings to light dramatic forces in sex chromosome evolution: lineage-specific convergent acquisition and amplification of X-Y gene families, possibly fueled by antagonism between acquired X-Y homologs. The mouse MSY sequence presents opportunities for experimental studies of a sex-specific chromosome in its entirety, in a genetically tractable model organism.
Assuntos
Evolução Biológica , Cromossomos de Mamíferos , Camundongos Endogâmicos C57BL/genética , Análise de Sequência de DNA , Cromossomo Y , Animais , Centrômero , Cromossomos Artificiais Bacterianos/genética , Feminino , Humanos , Masculino , Filogenia , Primatas/genética , Cromossomo XRESUMO
High-quality and complete reference genome assemblies are fundamental for the application of genomics to biology, disease, and biodiversity conservation. However, such assemblies are available for only a few non-microbial species1-4. To address this issue, the international Genome 10K (G10K) consortium5,6 has worked over a five-year period to evaluate and develop cost-effective methods for assembling highly accurate and nearly complete reference genomes. Here we present lessons learned from generating assemblies for 16 species that represent six major vertebrate lineages. We confirm that long-read sequencing technologies are essential for maximizing genome quality, and that unresolved complex repeats and haplotype heterozygosity are major sources of assembly error when not handled correctly. Our assemblies correct substantial errors, add missing sequence in some of the best historical reference genomes, and reveal biological discoveries. These include the identification of many false gene duplications, increases in gene sizes, chromosome rearrangements that are specific to lineages, a repeated independent chromosome breakpoint in bat genomes, and a canonical GC-rich pattern in protein-coding genes and their regulatory regions. Adopting these lessons, we have embarked on the Vertebrate Genomes Project (VGP), an international effort to generate high-quality, complete reference genomes for all of the roughly 70,000 extant vertebrate species and to help to enable a new era of discovery across the life sciences.
Assuntos
Genoma , Genômica/métodos , Vertebrados/genética , Animais , Aves , Biblioteca Gênica , Tamanho do Genoma , Genoma Mitocondrial , Haplótipos , Sequenciamento de Nucleotídeos em Larga Escala , Anotação de Sequência Molecular , Alinhamento de Sequência , Análise de Sequência de DNA , Cromossomos Sexuais/genéticaRESUMO
Because of diverged adaptative phenotypes, fish species of the genus Xiphophorus have contributed to a wide range of research for a century. Existing Xiphophorus genome assemblies are not at the chromosomal level and are prone to sequence gaps, thus hindering advancement of the intra- and inter-species differences for evolutionary, comparative, and translational biomedical studies. Herein, we assembled high-quality chromosome-level genome assemblies for three distantly related Xiphophorus species, namely, X. maculatus, X. couchianus, and X. hellerii Our overall goal is to precisely assess microevolutionary processes in the clade to ascertain molecular events that led to the divergence of the Xiphophorus species and to progress understanding of genetic incompatibility to disease. In particular, we measured intra- and inter-species divergence and assessed gene expression dysregulation in reciprocal interspecies hybrids among the three species. We found expanded gene families and positively selected genes associated with live bearing, a special mode of reproduction. We also found positively selected gene families are significantly enriched in nonpolymorphic transposable elements, suggesting the dispersal of these nonpolymorphic transposable elements has accompanied the evolution of the genes, possibly by incorporating new regulatory elements in support of the Britten-Davidson hypothesis. We characterized inter-specific polymorphisms, structural variants, and polymorphic transposable element insertions and assessed their association to interspecies hybridization-induced gene expression dysregulation related to specific disease states in humans.
Assuntos
Ciprinodontiformes , Elementos de DNA Transponíveis , Animais , Humanos , Elementos de DNA Transponíveis/genética , Epistasia Genética , Hibridização Genética , Ciprinodontiformes/genética , Ciprinodontiformes/metabolismoRESUMO
An amendment to this paper has been published and can be accessed via a link at the top of the paper.
RESUMO
Ruminants have a semi-invasive placenta, which possess highly vascularized placentomes formed by maternal endometrial caruncles and fetal placental cotyledons and required for fetal development to term. The synepitheliochorial placenta of cattle contains at least two trophoblast cell populations, including uninucleate (UNC) and binucleate (BNC) cells that are most abundant in the cotyledonary chorion of the placentomes. The interplacentomal placenta is more epitheliochorial in nature with the chorion developing specialized areolae over the openings of uterine glands. Of note, the cell types in the placenta and cellular and molecular mechanisms governing trophoblast differentiation and function are little understood in ruminants. To fill this knowledge gap, the cotyledonary and intercotyledonary areas of the mature day 195 bovine placenta were analyzed by single nuclei analysis. Single-nuclei RNA-seq analysis found substantial differences in cell type composition and transcriptional profiles between the two distinct regions of the placenta. Based on clustering and cell marker gene expression, five different trophoblast cell types were identified in the chorion, including proliferating and differentiating UNC and two different types of BNC in the cotyledon. Cell trajectory analyses provided a framework for understanding the differentiation of trophoblast UNC into BNC. The upstream transcription factor binding analysis of differentially expressed genes identified a candidate set of regulator factors and genes regulating trophoblast differentiation. This foundational information is useful to discover essential biological pathways underpinning the development and function of the bovine placenta.
Assuntos
Placenta , Trofoblastos , Gravidez , Bovinos , Animais , Feminino , Trofoblastos/metabolismo , Placenta/metabolismo , RNA Nuclear Pequeno/metabolismo , Ruminantes , Análise de Sequência de RNARESUMO
The Amazon molly is a unique clonal fish species that originated from an interspecies hybrid between Poecilia species P. mexicana and P. latipinna It reproduces by gynogenesis, which eliminates paternal genomic contribution to offspring. An earlier study showed that Amazon molly shows biallelic expression for a large portion of the genome, leading to two main questions: (1) Are the allelic expression patterns from the initial hybridization event stabilized or changed during establishment of the asexual species and its further evolution? (2) Is allelic expression biased toward one parental allele a stochastic or adaptive process? To answer these questions, the allelic expression of P. formosa siblings was assessed to investigate intra- and inter-cohort allelic expression variability. For comparison, interspecies hybrids between P. mexicana and P. latipinna were produced in the laboratory to represent the P. formosa ancestor. We have identified inter-cohort and intra-cohort variation in parental allelic expression. The existence of inter-cohort divergence suggests functional P. formosa allelic expression patterns do not simply reflect the atavistic situation of the first interspecies hybrid but potentially result from long-term selection of transcriptional fitness. In addition, clonal fish show a transcriptional trend representing minimal intra-clonal variability in allelic expression patterns compared to the corresponding hybrids. The intra-clonal similarity in gene expression translates to sophisticated genetic functional regulation at the individuum level. These findings suggest the parental alleles inherited by P. formosa form tightly regulated genetic networks that lead to a stable transcriptomic landscape within clonal individuals.
Assuntos
Alelos , Poecilia/genética , Transcriptoma , Animais , Feminino , Regulação da Expressão Gênica , Hibridização Genética , MasculinoRESUMO
A central determinant of pregnancy success is proper development of the conceptus (embryo/fetus and associated extraembryonic membranes including the placenta). Although the gross morphology and histology of the bovine placenta have been well studied, the cellular and molecular mechanisms regulating placenta development and trophoblast differentiation and function remain essentially undefined. Here, single-cell transcriptome (scRNA-seq) analysis was performed on the day 17 bovine conceptus and chorion of day 24, 30, and 50 conceptuses (n = 3-4 samples per day) using the 10X Genomics platform. Bioinformatic analyses identified cell types and their ontogeny including trophoblast, mesenchyme, and immune cells. Loss of interferon tau-expressing trophoblast uninucleate cells occurred between days 17 and 30, whereas binucleate cells, identified based on expression of placental lactogen (CSH2) and specific pregnancy-associated glycoprotein genes (PAGs), first appeared on day 24. Several different types of uninucleate cells were present in day 24, 30, and 50 samples, but only one (day 24) or two types of binucleate cells (days 30 and 50). Cell trajectory analyses provided a conceptual framework for uninucleate cell development and binucleate cell differentiation, and bioinformatic analyses identified candidate transcription factors governing differentiation and function of the trophoblasts. The digital atlas of cell types in the developing bovine conceptus reported here serves as a resource to discover key genes and biological pathways regulating its development during the critical periods of implantation and placentation.
Assuntos
Placenta , Trofoblastos , Gravidez , Bovinos , Animais , Feminino , Placenta/metabolismo , Trofoblastos/metabolismo , Placentação , Implantação do Embrião , Diferenciação CelularRESUMO
OBJECTIVE: Gene expression analysis through single-cell RNA sequencing (scRNA-seq) has revolutionized our understanding of gene regulation in diverse cell types, tissues, and organisms. While existing methods primarily focus on identifying cell type-specific gene expression programs (GEPs), the characterization of GEPs associated with biological processes and stimuli responses remains limited. In this study, we aim to infer biologically meaningful GEPs that are associated with both cellular phenotypes and activity programs directly from scRNA-seq data. METHODS: We applied linear CorEx, a machine-learning-based approach, to infer GEPs by grouping genes based on total correlation optimization function in simulated and real-world scRNA-seq datasets. Additionally, we utilized a transfer learning approach to project CorEx-inferred GEPs to other scRNA-seq datasets. RESULTS: By leveraging total correlation optimization, linear CorEx groups genes and demonstrates superior performance in identifying cell types and activity programs compared to similar methods using simulated data. Furthermore, we apply this same approach to real-world scRNA-seq data from the mouse dentate gyrus and embryonic colon development, uncovering biologically relevant GEPs related to cell types, developmental ages, and cell cycle programs. We also demonstrate the potential for transfer learning by evaluating similar datasets, showcasing the cross-species sensitivity of linear CorEx. CONCLUSION: Our findings validate linear CorEx as a valuable tool for comprehensively analyzing complex signals in scRNA-seq data, leading to deeper insights into gene expression dynamics, cellular heterogeneity, and regulatory mechanisms.
Assuntos
Aprendizado de Máquina , RNA-Seq , Análise da Expressão Gênica de Célula Única , Animais , Humanos , Camundongos , Algoritmos , Colo/metabolismo , Colo/citologia , Biologia Computacional/métodos , Giro Denteado/metabolismo , Perfilação da Expressão Gênica/métodos , RNA-Seq/métodosRESUMO
Ecological flexibility, extended lifespans, and large brains have long intrigued evolutionary biologists, and comparative genomics offers an efficient and effective tool for generating new insights into the evolution of such traits. Studies of capuchin monkeys are particularly well situated to shed light on the selective pressures and genetic underpinnings of local adaptation to diverse habitats, longevity, and brain development. Distributed widely across Central and South America, they are inventive and extractive foragers, known for their sensorimotor intelligence. Capuchins have among the largest relative brain size of any monkey and a lifespan that exceeds 50 y, despite their small (3 to 5 kg) body size. We assemble and annotate a de novo reference genome for Cebus imitator Through high-depth sequencing of DNA derived from blood, various tissues, and feces via fluorescence-activated cell sorting (fecalFACS) to isolate monkey epithelial cells, we compared genomes of capuchin populations from tropical dry forests and lowland rainforests and identified population divergence in genes involved in water balance, kidney function, and metabolism. Through a comparative genomics approach spanning a wide diversity of mammals, we identified genes under positive selection associated with longevity and brain development. Additionally, we provide a technological advancement in the use of noninvasive genomics for studies of free-ranging mammals. Our intra- and interspecific comparative study of capuchin genomics provides insights into processes underlying local adaptation to diverse and physiologically challenging environments, as well as the molecular basis of brain evolution and longevity.
Assuntos
Adaptação Fisiológica , Encéfalo/crescimento & desenvolvimento , Cebus/genética , Genoma , Longevidade/genética , Animais , Evolução Molecular , Citometria de Fluxo/métodos , Florestas , Genômica/métodosRESUMO
BACKGROUND: The red junglefowl, the wild outgroup of domestic chickens, has historically served as a reference for genomic studies of domestic chickens. These studies have provided insight into the etiology of traits of commercial importance. However, the use of a single reference genome does not capture diversity present among modern breeds, many of which have accumulated molecular changes due to drift and selection. While reference-based resequencing is well-suited to cataloging simple variants such as single-nucleotide changes and short insertions and deletions, it is mostly inadequate to discover more complex structural variation in the genome. METHODS: We present a pangenome for the domestic chicken consisting of thirty assemblies of chickens from different breeds and research lines. RESULTS: We demonstrate how this pangenome can be used to catalog structural variants present in modern breeds and untangle complex nested variation. We show that alignment of short reads from 100 diverse wild and domestic chickens to this pangenome reduces reference bias by 38%, which affects downstream genotyping results. This approach also allows for the accurate genotyping of a large and complex pair of structural variants at the K feathering locus using short reads, which would not be possible using a linear reference. CONCLUSIONS: We expect that this new paradigm of genomic reference will allow better pinpointing of exact mutations responsible for specific phenotypes, which will in turn be necessary for breeding chickens that meet new sustainability criteria and are resilient to quickly evolving pathogen threats.
Assuntos
Galinhas , Genoma , Animais , Galinhas/genética , Genótipo , Análise de Sequência de DNA , GenômicaRESUMO
Clinicopathological presentations are critical for establishing a postoperative treatment regimen in Colorectal Cancer (CRC), although the prognostic value is low in Stage 2 CRC. We implemented a novel exploratory algorithm based on artificial intelligence (explainable artificial intelligence, XAI) that integrates mutational and clinical features to identify genomic signatures by repurposing the FoundationOne Companion Diagnostic (F1CDx) assay. The training data set (n = 378) consisted of subjects with recurrent and non-recurrent Stage 2 or 3 CRC retrieved from TCGA. Genomic signatures were built for identifying subgroups in Stage 2 and 3 CRC patients according to recurrence using genomic parameters and further associations with the clinical presentation. The summarization of the top-performing genomic signatures resulted in a 32-gene genomic signature that could predict tumor recurrence in CRC Stage 2 patients with high precision. The genomic signature was further validated using an independent dataset (n = 149), resulting in high-precision prognosis (AUC: 0.952; PPV = 0.974; NPV = 0.923). We anticipate that our genomic signatures and NCCN guidelines will improve recurrence predictions in CRC molecular stratification.
Assuntos
Inteligência Artificial , Neoplasias Colorretais , Humanos , Recidiva Local de Neoplasia/patologia , Neoplasias Colorretais/patologia , Mutação , Genômica , Regulação Neoplásica da Expressão GênicaRESUMO
The epigenetic regulation of immune response involves reversible and heritable changes that do not alter the DNA sequence. Though there have been extensive studies accomplished relating to epigenetic changes in cancer cells, recent focus has been shifted on epigenetic-mediated changes in the immune cells including T cells, Macrophages, Natural Killer cells and anti-tumor immune responses. This review compiles the most relevant and recent literature related to the role of epigenetic mechanisms including DNA methylation and histone modifications in immune cells of wide range of cancers. We also include recent research with respect to role of the most relevant transcription factors that epigenetically control the anti-tumor immune response. Finally, a statement of future direction that promises to look forward for strategies to improve immunotherapy in cancer.
Assuntos
Epigênese Genética , Neoplasias , Metilação de DNA , Humanos , Imunoterapia , Neoplasias/genética , Neoplasias/terapiaRESUMO
Studies of Y Chromosome evolution have focused primarily on gene decay, a consequence of suppression of crossing-over with the X Chromosome. Here, we provide evidence that suppression of X-Y crossing-over unleashed a second dynamic: selfish X-Y arms races that reshaped the sex chromosomes in mammals as different as cattle, mice, and men. Using super-resolution sequencing, we explore the Y Chromosome of Bos taurus (bull) and find it to be dominated by massive, lineage-specific amplification of testis-expressed gene families, making it the most gene-dense Y Chromosome sequenced to date. As in mice, an X-linked homolog of a bull Y-amplified gene has become testis-specific and amplified. This evolutionary convergence implies that lineage-specific X-Y coevolution through gene amplification, and the selfish forces underlying this phenomenon, were dominatingly powerful among diverse mammalian lineages. Together with Y gene decay, X-Y arms races molded mammalian sex chromosomes and influenced the course of mammalian evolution.
Assuntos
Análise de Sequência de DNA/veterinária , Cromossomo X/genética , Cromossomo Y/genética , Animais , Bovinos , Linhagem da Célula , Troca Genética , Evolução Molecular , Feminino , Amplificação de Genes , Humanos , Masculino , Camundongos , Especificidade de Órgãos , Testículo/químicaRESUMO
A species' success during the invasion of new areas hinges on an interplay between the demographic processes common to invasions and the specific ecological context of the novel environment. Evolutionary genetic studies of invasive species can investigate how genetic bottlenecks and ecological conditions shape genetic variation in invasions, and our study pairs two invasive populations that are hypothesized to be from the same source population to compare how each population evolved during and after introduction. Invasive European starlings (Sturnus vulgaris) established populations in both Australia and North America in the 19th century. Here, we compare whole-genome sequences among native and independently introduced European starling populations to determine how demographic processes interact with rapid evolution to generate similar genetic patterns in these recent and replicated invasions. Demographic models indicate that both invasive populations experienced genetic bottlenecks as expected based on invasion history, and we find that specific genomic regions have differentiated even on this short evolutionary timescale. Despite genetic bottlenecks, we suggest that genetic drift alone cannot explain differentiation in at least two of these regions. The demographic boom intrinsic to many invasions as well as potential inversions may have led to high population-specific differentiation, although the patterns of genetic variation are also consistent with the hypothesis that this infamous and highly mobile invader adapted to novel selection (e.g., extrinsic factors). We use targeted sampling of replicated invasions to identify and evaluate support for multiple, interacting evolutionary mechanisms that lead to differentiation during the invasion process.
RESUMO
The domestic cat (Felis catus) numbers over 94 million in the USA alone, occupies households as a companion animal, and, like humans, suffers from cancer and common and rare diseases. However, genome-wide sequence variant information is limited for this species. To empower trait analyses, a new cat genome reference assembly was developed from PacBio long sequence reads that significantly improve sequence representation and assembly contiguity. The whole genome sequences of 54 domestic cats were aligned to the reference to identify single nucleotide variants (SNVs) and structural variants (SVs). Across all cats, 16 SNVs predicted to have deleterious impacts and in a singleton state were identified as high priority candidates for causative mutations. One candidate was a stop gain in the tumor suppressor FBXW7. The SNV is found in cats segregating for feline mediastinal lymphoma and is a candidate for inherited cancer susceptibility. SV analysis revealed a complex deletion coupled with a nearby potential duplication event that was shared privately across three unrelated cats with dwarfism and is found within a known dwarfism associated region on cat chromosome B1. This SV interrupted UDP-glucose 6-dehydrogenase (UGDH), a gene involved in the biosynthesis of glycosaminoglycans. Importantly, UGDH has not yet been associated with human dwarfism and should be screened in undiagnosed patients. The new high-quality cat genome reference and the compilation of sequence variation demonstrate the importance of these resources when searching for disease causative alleles in the domestic cat and for identification of feline biomedical models.
Assuntos
Nanismo/genética , Proteína 7 com Repetições F-Box-WD/genética , Genoma/genética , Uridina Difosfato Glucose Desidrogenase/genética , Sequenciamento Completo do Genoma , Alelos , Animais , Gatos , Mapeamento Cromossômico , Predisposição Genética para Doença , Genômica , Humanos , Masculino , Anotação de Sequência Molecular , Filogenia , Polimorfismo de Nucleotídeo Único/genéticaRESUMO
BACKGROUND: Circulating tumor cells (CTCs) are liquid biopsies that represent micrometastatic disease and may offer unique insights into future recurrences in non-small cell lung cancer (NSCLC). Due to CTC rarity and limited stability, no stable CTC-derived xenograft (CDX) models have ever been generated from non-metastatic NSCLC patients directly. Alternative strategies are needed to molecularly characterize CTCs and means of potential future metastases in this potentially curable patient group. METHODS: Surgically resected NSCLC primary tumor tissues from non-metastatic patients were implanted subcutaneously in immunodeficient mice to establish primary tumor patient-derived xenograft (ptPDX) models. CTCs were isolated as liquid biopsies from the blood of ptPDX mice and re-implanted subcutaneously into naïve immunodeficient mice to generate liquid biopsy CTC-derived xenograft (CDX) tumor models. Single cell RNA sequencing was performed and validated in an external dataset of non-xenografted human NSCLC primary tumor and metastases tissues. Drug response testing in CDX models was performed with standard of care chemotherapy (carboplatin/paclitaxel). Blockade of MYC, which has a known role in drug resistance, was performed with a MYC/MAX dimerization inhibitor (10058-F4). RESULTS: Out of ten ptPDX, two (20%) stable liquid biopsy CDX mouse models were generated. Single cell RNA sequencing analysis revealed an additional regenerative alveolar epithelial type II (AT2)-like cell population in CDX tumors that was also identified in non-xenografted NSCLC patients' metastases tissues. Drug testing using these CDX models revealed different treatment responses to carboplatin/paclitaxel. MYC target genes and c-MYC protein were upregulated in the chemoresistant CDX model, while MYC/MAX dimerization blocking could overcome chemoresistance to carboplatin/paclitaxel. CONCLUSIONS: To overcome the lack of liquid biopsy CDX models from non-metastatic NSCLC patients, CDX models can be generated with CTCs from ptPDX models that were originally established from patients' primary tumors. Single cell analyses can identify distinct drug responses and cell heterogeneities in CDX tumors that can be validated in NSCLC metastases tissues. CDX models deserve further development and study to discover personalized strategies against micrometastases in non-metastatic NSCLC patients.
Assuntos
Carcinoma Pulmonar de Células não Pequenas , Neoplasias Pulmonares , Células Neoplásicas Circulantes , Animais , Carboplatina/farmacologia , Carboplatina/uso terapêutico , Carcinogênese , Carcinoma Pulmonar de Células não Pequenas/tratamento farmacológico , Carcinoma Pulmonar de Células não Pequenas/genética , Carcinoma Pulmonar de Células não Pequenas/patologia , Modelos Animais de Doenças , Xenoenxertos , Humanos , Neoplasias Pulmonares/tratamento farmacológico , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/patologia , Camundongos , Células Neoplásicas Circulantes/patologia , Paclitaxel/farmacologia , Paclitaxel/uso terapêuticoRESUMO
Non-small-cell lung cancer (NSCLC) accounts for most cancer-related deaths worldwide. Liquid biopsy by a blood draw to detect circulating tumor cells (CTCs) is a tool for molecular profiling of cancer using single-cell and next-generation sequencing (NGS) technologies. The aim of the study was to identify somatic variants in single CTCs isolated from NSCLC patients by targeted NGS. Thirty-one subjects (20 NSCLC patients, 11 smokers without cancer) were enrolled for blood draws (7.5 mL). CTCs were identified by immunofluorescence, individually retrieved, and DNA-extracted. Targeted NGS was performed to detect somatic variants (single-nucleotide variants (SNVs) and insertions/deletions (Indels)) across 65 oncogenes and tumor suppressor genes. Cancer-associated variants were classified using OncoKB database. NSCLC patients had significantly higher CTC counts than control smokers (p = 0.0132; Mann-Whitney test). Analyzing 23 CTCs and 13 white blood cells across seven patients revealed a total of 644 somatic variants that occurred in all CTCs within the same subject, ranging from 1 to 137 per patient. The highest number of variants detected in ≥1 CTC within a patient was 441. A total of 18/65 (27.7%) genes were highly mutated. Mutations with oncogenic impact were identified in functional domains of seven oncogenes/tumor suppressor genes (NF1, PTCH1, TP53, SMARCB1, SMAD4, KRAS, and ERBB2). Single CTC-targeted NGS detects heterogeneous and shared mutational signatures within and between NSCLC patients. CTC single-cell genomics have potential for integration in NSCLC precision oncology.