RESUMO
BACKGROUND: Large-scale initiatives like The Cancer Genome Atlas (TCGA) performed genomics studies on predominantly Caucasian kidney cancer. In this study, we aimed to investigate genomics of Chinese clear cell renal cell carcinoma (ccRCC). METHODS: We performed whole-transcriptomic sequencing on 55 tumor tissues and 11 matched normal tissues from Chinese ccRCC patients. We systematically analyzed the data from our cohort and comprehensively compared with the TCGA ccRCC cohort. RESULTS: It found that PBRM1 mutates with a frequency of 11% in our cohort, much lower than that in TCGA Caucasians (33%). Besides, 31 gene fusions including 5 recurrent ones, that associated with apoptosis, tumor suppression and metastasis were identified. We classified our cohort into three classes by gene expression. Class 1 shows significantly elevated gene expression in the VEGF pathway, while Class 3 has comparably suppressed expression of this pathway. Class 2 is characterized by increased expression of extracellular matrix organization genes and is associated with high-grade tumors. Applying the classification to TCGA ccRCC patients revealed better distinction of tumor prognosis than reported classifications. Class 2 shows worst survival and Class 3 is a rare subtype ccRCC in the TCGA cohort. Furthermore, computational analysis on the immune microenvironment of ccRCC identified immune-active and tolerant tumors with significant increased macrophages and depleted CD4 positive T-cells, thus some patients may benefit from immunotherapies. CONCLUSION: In summary, results presented in this study shed light into distinct genomic expression profiles in Chinese population, modified the stratification patterns by new molecular classification, and gave practical guidelines on clinical treatment of ccRCC patients.
RESUMO
Using next-generation sequencing technology alone, we have successfully generated and assembled a draft sequence of the giant panda genome. The assembled contigs (2.25 gigabases (Gb)) cover approximately 94% of the whole genome, and the remaining gaps (0.05 Gb) seem to contain carnivore-specific repeats and tandem repeats. Comparisons with the dog and human showed that the panda genome has a lower divergence rate. The assessment of panda genes potentially underlying some of its unique traits indicated that its bamboo diet might be more dependent on its gut microbiome than its own genetic composition. We also identified more than 2.7 million heterozygous single nucleotide polymorphisms in the diploid genome. Our data and analyses provide a foundation for promoting mammalian genetic research, and demonstrate the feasibility for using next-generation sequencing technologies for accurate, cost-effective and rapid de novo assembly of large eukaryotic genomes.
Assuntos
Genoma/genética , Genômica , Ursidae/genética , Algoritmos , Animais , China , Sequência Conservada/genética , Mapeamento de Sequências Contíguas , Dieta/veterinária , Cães , Evolução Molecular , Feminino , Fertilidade/genética , Fertilidade/fisiologia , Heterozigoto , Humanos , Família Multigênica/genética , Polimorfismo de Nucleotídeo Único/genética , Receptores Acoplados a Proteínas G/genética , Alinhamento de Sequência , Análise de Sequência de DNA , Sintenia/genética , Ursidae/classificação , Ursidae/fisiologiaRESUMO
Papaya, a fruit crop cultivated in tropical and subtropical regions, is known for its nutritional benefits and medicinal applications. Here we report a 3x draft genome sequence of 'SunUp' papaya, the first commercial virus-resistant transgenic fruit tree to be sequenced. The papaya genome is three times the size of the Arabidopsis genome, but contains fewer genes, including significantly fewer disease-resistance gene analogues. Comparison of the five sequenced genomes suggests a minimal angiosperm gene set of 13,311. A lack of recent genome duplication, atypical of other angiosperm genomes sequenced so far, may account for the smaller papaya gene number in most functional groups. Nonetheless, striking amplifications in gene number within particular functional groups suggest roles in the evolution of tree-like habit, deposition and remobilization of starch reserves, attraction of seed dispersal agents, and adaptation to tropical daylengths. Transgenesis at three locations is closely associated with chloroplast insertions into the nuclear genome, and with topoisomerase I recognition sites. Papaya offers numerous advantages as a system for fruit-tree functional genomics, and this draft genome sequence provides the foundation for revealing the basis of Carica's distinguishing morpho-physiological, medicinal and nutritional properties.
Assuntos
Carica/genética , Genoma de Planta/genética , Arabidopsis/genética , Mapeamento de Sequências Contíguas , Bases de Dados Genéticas , Genes de Plantas/genética , Dados de Sequência Molecular , Plantas Geneticamente Modificadas/genética , Alinhamento de Sequência , Análise de Sequência de DNA , Fatores de Transcrição/genética , Clima TropicalRESUMO
Next-generation massively parallel DNA sequencing technologies provide ultrahigh throughput at a substantially lower unit data cost; however, the data are very short read length sequences, making de novo assembly extremely challenging. Here, we describe a novel method for de novo assembly of large genomes from short read sequences. We successfully assembled both the Asian and African human genome sequences, achieving an N50 contig size of 7.4 and 5.9 kilobases (kb) and scaffold of 446.3 and 61.9 kb, respectively. The development of this de novo short read assembly method creates new opportunities for building reference sequences and carrying out accurate analyses of unexplored genomes in a cost-effective way.
Assuntos
Genoma Humano , Projeto Genoma Humano , Alinhamento de Sequência/métodos , Análise de Sequência de DNA/métodos , Povo Asiático/genética , População Negra/genética , Humanos , Análise de Sequência com Séries de Oligonucleotídeos/economia , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Alinhamento de Sequência/economia , Análise de Sequência de DNA/economiaRESUMO
Tumor angiogenesis is a cancer hallmark, and its therapeutic inhibition has provided meaningful, albeit limited, clinical benefit. While anti-angiogenesis inhibitors deprive the tumor of oxygen and essential nutrients, cancer cells activate metabolic adaptations to diminish therapeutic response. Despite these adaptations, angiogenesis inhibition incurs extensive metabolic stress, prompting us to consider such metabolic stress as an induced vulnerability to therapies targeting cancer metabolism. Metabolomic profiling of angiogenesis-inhibited intracranial xenografts showed universal decrease in tricarboxylic acid cycle intermediates, corroborating a state of anaplerotic nutrient deficit or stress. Accordingly, we show strong synergy between angiogenesis inhibitors (Avastin, Tivozanib) and inhibitors of glycolysis or oxidative phosphorylation through exacerbation of anaplerotic nutrient stress in intracranial orthotopic xenografted gliomas. Our findings were recapitulated in GBM xenografts that do not have genetically predisposed metabolic vulnerabilities at baseline. Thus, our findings cement the central importance of the tricarboxylic acid cycle as the nexus of metabolic vulnerabilities and suggest clinical path hypothesis combining angiogenesis inhibitors with pharmacological cancer interventions targeting tumor metabolism for GBM tumors.
RESUMO
Tumor biology is determined not only by immortal cancer cells but also by the tumor microenvironment consisting of noncancerous cells and extracellular matrix, together they dictate the pathogenesis and response to treatments. Tumor purity is the proportion of cancer cells in a tumor. It is a fundamental property of cancer and is associated with many clinical features and outcomes. Here we report the first systematic study of tumor purity in patient-derived xenograft (PDX) and syngeneic tumor models using next-generation sequencing data from >9,000 tumors. We found that tumor purity in PDX models is cancer specific and mimics patient tumors, with variation in stromal content and immune infiltration influenced by immune systems of host mice. After the initial engraftment, human stroma in a PDX tumor is quickly replaced by mouse stroma, and tumor purity then stays stable in subsequent transplantations and increases only slightly by passage. Similarly, in syngeneic mouse cancer cell line models, tumor purity also turns out to be an intrinsic property with model and cancer specificities. Computational and pathology analysis confirmed the impact on tumor purity by the diverse stromal and immune profiles. Our study deepens the understanding of mouse tumor models, which will enable their better and novel uses in developing cancer therapeutics, especially ones targeting tumor microenvironment. Significance: PDX models are an ideal experimental system to study tumor purity because of its distinct separation of human tumor cells and mouse stromal and immune cells. This study provides a comprehensive view of tumor purity in 27 cancers in PDX models. It also investigates tumor purity in 19 syngeneic models based on unambiguously identified somatic mutations. It will facilitate tumor microenvironment research and drug development in mouse tumor models.
Assuntos
Neoplasias , Humanos , Animais , Camundongos , Ensaios Antitumorais Modelo de Xenoenxerto , Neoplasias/genética , Sistema Imunitário , Microambiente TumoralRESUMO
Studies have revealed key genomic aberrations in pediatric acute myeloid leukemia (AML) based on Western populations. It is unknown to what extent the current genomic findings represent populations with different ethnic backgrounds. Here we present the genomic landscape of driver alterations of Chinese pediatric AML and discover previously undescribed genomic aberrations, including the XPO1-TNRC18 fusion. Comprehensively comparing between the Chinese and Western AML cohorts reveal a substantially distinct genomic alteration profile. For example, Chinese AML patients more commonly exhibit mutations in KIT and CSF3R, and less frequently mutated of genes in the RAS signaling pathway. These differences in mutation frequencies lead to the detection of previously uncharacterized co-occurring mutation pairs. Importantly, the distinct driver profile is clinical relevant. We propose a refined prognosis risk classification model which better reflected the adverse event risk for Chinese AML patients. These results emphasize the importance of genetic background in precision medicine.
Assuntos
Leucemia Mieloide Aguda , Criança , China , Genômica , Humanos , Leucemia Mieloide Aguda/diagnóstico , Leucemia Mieloide Aguda/genética , Mutação , Taxa de MutaçãoRESUMO
Patient-derived tumor xenografts (PDXs) are considered the most predictive preclinical models, largely believed to be driven by cancer stem cells (CSC) for conventional cancer drug evaluation. A large library of PDXs is reflective of the diversity of patient populations and thus enables population based preclinical trials ("Phase II-like mouse clinical trials"); however, PDX have practical limitations of low throughput, high costs and long duration. Tumor organoids, also being patient-derived CSC-driven models, can be considered as the in vitro equivalent of PDX, overcoming certain PDX limitations for dealing with large libraries of organoids or compounds. This study describes a method to create PDX-derived organoids (PDXO), thus resulting in paired models for in vitro and in vivo pharmacology research. Subcutaneously-transplanted PDX-CR2110 tumors were collected from tumor-bearing mice when the tumors reached 200-800 mm3, per an approved autopsy procedure, followed by removal of the adjacent non-tumor tissues and dissociation into small tumor fragments. The small tumor fragments were washed and passed through a 100 µm cell strainer to remove the debris. Cell clusters were collected and suspended in basement membrane extract (BME) solution and plated in a 6-well plate as a solid droplet with surrounding liquid media for growth in a CO2 incubator. Organoid growth was monitored twice weekly under light microscopy and recorded by photography, followed by liquid medium change 2 or 3 times a week. The grown organoids were further passaged (7 days later) at a 1:2 ratio by disrupting the BME embedded organoids using mechanical shearing, aided by addition of trypsin and the addition of 10 µM Y-27632. Organoids were cryopreserved in cryo-tubes for long-term storage, after release from BME by centrifugation, and also sampled (e.g., DNA, RNA and FFPE block) for further characterization.
Assuntos
Antineoplásicos , Neoplasias , Organoides , Ensaios Antitumorais Modelo de Xenoenxerto , Animais , Modelos Animais de Doenças , Humanos , Camundongos , FarmacologiaRESUMO
Misidentification and contamination of biobank samples (e.g. cell lines) have plagued biomedical research. Short tandem repeat (STR) and single-nucleotide polymorphism assays are widely used to authenticate biosamples and detect contamination, but with insufficient sensitivity at 5-10% and 3-5%, respectively. Here, we describe a deep NGS-based method with significantly higher sensitivity (≤1%). It can be used to authenticate human and mouse cell lines, xenografts and organoids. It can also reliably identify and quantify contamination of human cell line samples, contaminated with only small amount of other cell samples; detect and quantify species-specific components in human-mouse mixed samples (e.g. xenografts) with 0.1% sensitivity; detect mycoplasma contamination; and infer population structure and gender of human samples. By adopting DNA barcoding technology, we are able to profile 100-200 samples in a single run at per-sample cost comparable to conventional STR assays, providing a truly high-throughput and low-cost assay for building and maintaining high-quality biobanks.
RESUMO
Treatment of infiltrative glioma presents a number of unique challenges due to poor penetration of typical chemotherapeutic agents into the infiltrating edge of tumors. The current chemotherapy options include nitrosoureas (e.g., lomustine) and the imidazotetrazine-class monofunctional DNA alkylating agent, temozolomide (TMZ). Both classes of drugs alkylate DNA and have relatively unrestricted passage from blood into brain where infiltrative tumor cells reside. Recent research indicates that secondary mutations detected in the RB and AKT-mTOR signaling pathways are linked to characteristics of recurrent tumors specific to TMZ-treated patients. It has been hypothesized that a decrease in rate of secondary mutations may result in delay of tumor recurrence. To that end, this study was designed to test viability of decreasing secondary mutations by disrupting the cell division cycle using eflornithine, a specific inhibitor of ornithine decarboxylase. U87MG glioblastoma cell line characterized by chromosomal abnormalities commonly attributed to primary cancers was used as a model for this study. The cells were subjected to TMZ treatment for 3 days followed by eflornithine (DFMO) treatment for 4 or 11 days. It was shown that TMZ significantly increased the frequency of mutations in U87MG glioblastoma cells while DFMO-treated cells showed mutation frequency statistically similar to that of the untreated cells on the respective treatment days. The findings of this study provide evidence to support the hypothesis that DFMO may inhibit progression of DNA mutations caused by alkylating chemotherapy agents, such as TMZ.
RESUMO
A high-density genetic map of papaya (Carica papaya L.) was constructed using microsatellite markers derived from BAC end sequences and whole-genome shot gun sequences. Fifty-four F(2) plants derived from varieties AU9 and SunUp were used for linkage mapping. A total of 707 markers, including 706 microsatellite loci and the morphological marker fruit flesh color, were mapped into nine major and three minor linkage groups. The resulting map spanned 1069.9 cM with an average distance of 1.5 cM between adjacent markers. This sequence-based microsatellite map resolved the very large linkage group 2 (LG 2) of the previous high-density map using amplified fragment length polymorphism markers. The nine major LGs of our map represent papaya's haploid nine chromosomes with LG 1 of the sex chromosome being the largest. This map validates the suppression of recombination at the male-specific region of the Y chromosome (MSY) mapped on LG 1 and at potential centromeric regions of other LGs. Segregation distortion was detected in a large region on LG 1 surrounding the MSY region due to the abortion of the YY genotype and in a region of LG6 due to an unknown cause. This high-density sequence-tagged genetic map is being used to integrate genetic and physical maps and to assign genome sequence scaffolds to papaya chromosomes. It provides a framework for comparative structural and evolutional genomic research in the order Brassicales.
Assuntos
Brassicaceae/genética , Carica/genética , Mapeamento Cromossômico/métodos , Evolução Biológica , Cromossomos de Plantas , Genes de Plantas , Ligação Genética , Genoma de Planta , Repetições de MicrossatélitesRESUMO
BACKGROUND: Epidermal growth factor receptor (EGFR) is reportedly overexpressed in most esophageal tumors, but most targeted therapies showed no efficacy in non-selected patients. This study aims at investigating the adaptive cetuximab subset in a cohort of esophageal squamous cell carcinoma (ESCC) patient-derived xenografts (PDXs). METHODS: A large panel of ESCC PDXs has been established. The copy number, mRNA expression and immunohistochemistry (IHC) of key EGFR pathways have been examined along with cetuximab response. A preclinical trial on a randomly selected cohort of 16 ESCC PDXs was conducted, and the genomic annotations of these models were compared against the efficacy readout of the mouse trial. RESULTS: The trial identified that 7 of 16 (43.8%) responded to cetuximab (ΔT/ΔC <0 as responders). The gene amplification and expression analysis indicated that EGFR copy number ≥5 (P=0.035), high EGFR mRNA expression (P=0.001) and IHC score of 2-3 (P=0.034) are associated with tumor growth inhibition by cetuximab, suggesting EGFR may function as a single predictive biomarker for cetuximab response in ESCC. CONCLUSIONS: Overall, our results suggest that an ESCC subtype with EGFR amplification and overexpression benefits from cetuximab treatment, which warrants further clinical confirmation.
RESUMO
Inter- and intra-tumour molecular heterogeneity is increasingly recognized in clear cell renal cell carcinoma (ccRCC). It may partially explain the diversity of responses to targeted therapies and the various clinical outcomes. In this study, a 56-year-old male ccRCC patient with multiple metastases received radical nephrectomy and resection of the metastatic tumour in chest wall. The surgical specimens were implanted into nude mice to establish patient-derived xenograft (PDX) models with KI2367 model derived from the primary tumour and KI2368 model from the metastastic tumour. The two modles were treated with Sorafenib, Sunitinib, Axitinib, combined Sorafenib/Sunitinib, or alternating therapy of Sorafenib and Sunitinib. Significant anti-tumour activity was found in KI2367 treated with Sorafenib/Sunitinib monotherapy, combined Sorafenib/Sunitinib, and alternating therapy of Sorafenib/Sunitinib (P<0.05) but not in that treated with Axitinib monotherapy. In contrast, KI2368 was significantly responsive to Sunitinib monotherapy, combined Sorafenib/Sunitinib therapy and alternating therapy of Sorafenib/Sunitinib but not responsive to Sorafenib and Axitinib monotherapy (P<0.05). RNAseq of the two models demonstrated that the expression levels of 1,725 genes including the drug targeted genes of PDGFA, PDGFB and PDGFRA were >5-fold higher in KI2367 than in KI2368 and the expression levels of 994 genes were > 5-fold higher in KI2368 than in KI2367. These results suggest the presence of intra-tumour molecular heterogeneity in this patient. This heterogeneity may influence the response to targeted therapies. Multiple biopsy, liquid biopsy and genomic analysis of intra- tumour molecular heterogeneity may help guide a more precise and effective plan in selecting targeted therapies for ccRCC patients.
Assuntos
Biomarcadores Tumorais , Carcinoma de Células Renais/genética , Heterogeneidade Genética , Neoplasias Renais/genética , Animais , Antineoplásicos/farmacologia , Antineoplásicos/uso terapêutico , Carcinoma de Células Renais/tratamento farmacológico , Carcinoma de Células Renais/patologia , Linhagem Celular Tumoral , Modelos Animais de Doenças , Expressão Gênica , Xenoenxertos , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Neoplasias Renais/tratamento farmacológico , Neoplasias Renais/patologia , Camundongos , Terapia de Alvo Molecular , Proteínas de Fusão Oncogênica/genética , Análise de Sequência de RNA , Carga Tumoral/efeitos dos fármacos , Carga Tumoral/genéticaRESUMO
The Cancer Genome Atlas (TCGA) project has generated abundant genomic data for human cancers of various histopathology types and enabled exploring cancer molecular pathology per big data approach. We developed a new algorithm based on most differentially expressed genes (DEG) per pairwise comparisons to calculate correlation coefficients to be used to quantify similarity within and between cancer types. We systematically compared TCGA cancers, demonstrating high correlation within types and low correlation between types, thus establishing molecular specificity of cancer types and an alternative diagnostic method largely equivalent to histopathology. Different coefficients for different cancers in study may reveal that the degree of the within-type homogeneity varies by cancer types. We also performed the same calculation using the TCGA-derived DEGs on patient-derived xenografts (PDX) of different histopathology types corresponding to the TCGA types, as well as on cancer cell lines. We, for the first time, demonstrated highly similar patterns for within- and between-type correlation between PDXs and patient samples in a systematic study, confirming the high relevance of PDXs as surrogate experimental models for human diseases. In contrast, cancer cell lines have drastically reduced expression similarity to both PDXs and patient samples. The studies also revealed high similarity between some types, for example, LUSC and HNSCC, but low similarity between certain subtypes, for example, LUAD and LUSC. Our newly developed algorithm seems to be a practical diagnostic method to classify and reclassify a disease, either human or xenograft, with better accuracy than traditional histopathology. Cancer Res; 76(16); 4619-26. ©2016 AACR.
Assuntos
Algoritmos , Linhagem Celular Tumoral , Xenoenxertos , Neoplasias/genética , Transcriptoma , Animais , Biologia Computacional , Perfilação da Expressão Gênica , Humanos , Camundongos , Análise de Sequência com Séries de Oligonucleotídeos , Patologia Molecular/métodosRESUMO
Cetuximab is a standard of care for treating EGFR-expressing metastatic colorectal carcinoma (mCRC) exclusive of those with KRAS mutations at codons 12/13. However, retrospective analysis has recently suggested that KRAS-G13D patients can still benefit, while only a fraction of KRAS wild-type patients can benefit, from the treatment. We set out to test this contradicting issue experimentally in an independent cohort of patient derived xenograft (PDX) diseases. We conducted a mouse clinical trial (MCT) enrolling a random cohort of 27 transcriptome sequenced CRC-PDXs to evaluate cetuximab activity. The treatment responses were analyzed against the KRAS 12/13 mutation alleles, as well as several other well-known oncogenic alleles. If the response is defined by >80% tumor growth inhibition, 8/27 PDXs (~30%) are responders versus 19/27 non-/partial responders (~70%). We found that indeed there are no significantly fewer KRAS-12/13-allele responders (4/8 or 50%) than non-/partial responders (7/19, or 37%). In particular, there are actually no fewer G13D responders (4/8, or 50%) than in non-/partial responders (2/19 or 10.5%) statistically. Furthermore, majority of the non-/partial responders tend to have certain activating oncogenic alleles (one or more of the following common ones: K/N-RAS-G12V/D, -A146T, -Q61H/R, BRAF-V600E, AKT1-L52R and PIK3CA-E545G/K). Our data on an independent cohort support the recent clinical observation, but against the current practiced patient stratification in the cetuximab CRC treatment. Meanwhile, our data seem to suggest that a set of the six-oncogenic alleles may be of better predictive value than the current practiced stratification, justifying a new prospective clinical investigation on an independent cohort for confirmation.
Assuntos
Biomarcadores Tumorais/genética , Cetuximab/uso terapêutico , Neoplasias Colorretais/patologia , Resistencia a Medicamentos Antineoplásicos/genética , Mutação/genética , Proteínas Proto-Oncogênicas p21(ras)/genética , Proteínas Proto-Oncogênicas/genética , Alelos , Animais , Antineoplásicos/uso terapêutico , Neoplasias Colorretais/tratamento farmacológico , Neoplasias Colorretais/genética , Regulação Neoplásica da Expressão Gênica , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Camundongos , Estudos Retrospectivos , Células Tumorais Cultivadas , Ensaios Antitumorais Modelo de XenoenxertoRESUMO
The duck (Anas platyrhynchos) is one of the principal natural hosts of influenza A viruses. We present the duck genome sequence and perform deep transcriptome analyses to investigate immune-related genes. Our data indicate that the duck possesses a contractive immune gene repertoire, as in chicken and zebra finch, and this repertoire has been shaped through lineage-specific duplications. We identify genes that are responsive to influenza A viruses using the lung transcriptomes of control ducks and ones that were infected with either a highly pathogenic (A/duck/Hubei/49/05) or a weakly pathogenic (A/goose/Hubei/65/05) H5N1 virus. Further, we show how the duck's defense mechanisms against influenza infection have been optimized through the diversification of its ß-defensin and butyrophilin-like repertoires. These analyses, in combination with the genomic and transcriptomic data, provide a resource for characterizing the interaction between host and influenza viruses.
Assuntos
Reservatórios de Doenças , Patos/genética , Patos/virologia , Genoma , Influenza Aviária/genética , Transcriptoma/genética , Animais , Sequência de Bases , Galinhas/genética , Vetores de Doenças , Patos/imunologia , Feminino , Gansos/genética , Genoma/fisiologia , Interações Hospedeiro-Patógeno/genética , Interações Hospedeiro-Patógeno/imunologia , Imunidade/genética , Influenza Aviária/imunologia , Dados de Sequência Molecular , Filogenia , Especificidade da EspécieRESUMO
Domestic yaks (Bos grunniens) provide meat and other necessities for Tibetans living at high altitude on the Qinghai-Tibetan Plateau and in adjacent regions. Comparison between yak and the closely related low-altitude cattle (Bos taurus) is informative in studying animal adaptation to high altitude. Here, we present the draft genome sequence of a female domestic yak generated using Illumina-based technology at 65-fold coverage. Genomic comparisons between yak and cattle identify an expansion in yak of gene families related to sensory perception and energy metabolism, as well as an enrichment of protein domains involved in sensing the extracellular environment and hypoxic stress. Positively selected and rapidly evolving genes in the yak lineage are also found to be significantly enriched in functional categories and pathways related to hypoxia and nutrition metabolism. These findings may have important implications for understanding adaptation to high altitude in other animal species and for hypoxia-related diseases in humans.
Assuntos
Aclimatação/genética , Altitude , Bovinos/genética , Bovinos/fisiologia , Animais , Sequência de Bases , DNA/genética , Evolução Molecular , Feminino , Genoma , Dados de Sequência Molecular , Família Multigênica , Filogenia , Seleção Genética , Especificidade da EspécieRESUMO
Here we integrate the de novo assembly of an Asian and an African genome with the NCBI reference human genome, as a step toward constructing the human pan-genome. We identified approximately 5 Mb of novel sequences not present in the reference genome in each of these assemblies. Most novel sequences are individual or population specific, as revealed by their comparison to all available human DNA sequence and by PCR validation using the human genome diversity cell line panel. We found novel sequences present in patterns consistent with known human migration paths. Cross-species conservation analysis of predicted genes indicated that the novel sequences contain potentially functional coding regions. We estimate that a complete human pan-genome would contain approximately 19-40 Mb of novel sequence not present in the extant reference genome. The extensive amount of novel sequence contributing to the genetic variation of the pan-genome indicates the importance of using complete genome sequencing and de novo assembly.
Assuntos
Genoma Humano/genética , Análise de Sequência de DNA/métodos , Animais , Sequência de Bases , Genética Populacional , Humanos , Alinhamento de Sequência , Especificidade da EspécieRESUMO
Cucumber is an economically important crop as well as a model system for sex determination studies and plant vascular biology. Here we report the draft genome sequence of Cucumis sativus var. sativus L., assembled using a novel combination of traditional Sanger and next-generation Illumina GA sequencing technologies to obtain 72.2-fold genome coverage. The absence of recent whole-genome duplication, along with the presence of few tandem duplications, explains the small number of genes in the cucumber. Our study establishes that five of the cucumber's seven chromosomes arose from fusions of ten ancestral chromosomes after divergence from Cucumis melo. The sequenced cucumber genome affords insight into traits such as its sex expression, disease resistance, biosynthesis of cucurbitacin and 'fresh green' odor. We also identify 686 gene clusters related to phloem function. The cucumber genome provides a valuable resource for developing elite cultivars and for studying the evolution and function of the plant vascular system.
Assuntos
Cucumis sativus/genética , Genoma de Planta , Elementos de DNA Transponíveis/genética , DNA de Plantas/química , Duplicação Gênica , Genes de Plantas , Imunidade Inata/genética , Dados de Sequência Molecular , Doenças das Plantas/genética , Sequências Repetitivas de Ácido Nucleico , SinteniaRESUMO
Cholera, caused by Vibrio cholerae, erupted globally from South Asia in 7 pandemics, but there were also local outbreaks between the 6(th) (1899-1923) and 7(th) (1961-present) pandemics. All the above are serotype O1, whereas environmental or invertebrate isolates are antigenically diverse. The pre 7th pandemic isolates mentioned above, and other minor pathogenic clones, are related to the 7(th) pandemic clone, while the 6(th) pandemic clone is in the same lineage but more distantly related, and non-pathogenic isolates show no clonal structure. To understand the origins and relationships of the pandemic clones, we sequenced the genomes of a 1937 prepandemic strain and a 6(th) pandemic isolate, and compared them with the published 7(th) pandemic genome. We distinguished mutational and recombinational events, and allocated these and other events, to specific branches in the evolutionary tree. There were more mutational than recombinational events, but more genes, and 44 times more base pairs, changed by recombination. We used the mutational single-nucleotide polymorphisms and known isolation dates of the prepandemic and 7(th) pandemic isolates to estimate the mutation rate, and found it to be 100 fold higher than usually assumed. We then used this to estimate the divergence date of the 6(th) and 7(th) pandemic clones to be about 1880. While there is a large margin of error, this is far more realistic than the 10,000-50,000 years ago estimated using the usual assumptions. We conclude that the 2 pandemic clones gained pandemic potential independently, and overall there were 29 insertions or deletions of one or more genes. There were also substantial changes in the major integron, attributed to gain of individual cassettes including copying from within, or loss of blocks of cassettes. The approaches used open up new avenues for analysing the origin and history of other important pathogens.