RESUMO
Alternative polyadenylation increases transcript diversities at the 3' end, regulating biological processes including cell differentiation, embryonic development and cancer progression. Here, we present a Bayesian method SCAPE, which enables de novo identification and quantification of polyadenylation (pA) sites at single-cell level by utilizing insert size information. We demonstrated its accuracy and robustness and identified 31 558 sites from 36 mouse organs, 43.8% (13 807) of which were novel. We illustrated that APA isoforms were associated with miRNAs binding and regulated in tissue-, cell type-and tumor-specific manners where no difference was found at gene expression level, providing an extra layer of information for cell clustering. Furthermore, we found genome-wide dynamic changes of APA usage during erythropoiesis and induced pluripotent stem cell (iPSC) differentiation, suggesting APA contributes to the functional flexibility and diversity of single cells. We expect SCAPE to aid the analyses of cellular dynamics and diversities in health and disease.
Assuntos
Células-Tronco Pluripotentes Induzidas , MicroRNAs , Regiões 3' não Traduzidas/genética , Animais , Teorema de Bayes , Diferenciação Celular/genética , Células-Tronco Pluripotentes Induzidas/metabolismo , Camundongos , MicroRNAs/genética , MicroRNAs/metabolismo , PoliadenilaçãoRESUMO
The development of DNA barcoding method has given rise to a promising way of studying genetic taxonomy. Our previous study showed that pyrosequencing profile of 18S rDNA V7 hypervariable region can be used for identifying water sources without resolving the exact components of diatom colonies in water samples. In this continued study, we aimed to improve the established analysis method and to provide scientific evidence for forensic practices. A drowning animal model was set up by injecting mimic drowning fluid into the respiratory tract of the rabbit. In order to minimize the interference of animal DNA, the hypervariable region of chloroplast ribulose-1,5-bisphosphate carboxylase large unit gene (rbcL) was used as the pyrosequencing target region for the consistency analysis of plankton populations in tissues and water samples. After decoding the pyrosequencing profile of the targeted rbcL gene with the AdvISER-M-PYRO algorithm, the plankton colony that was inhaled into drowning animal lung tissue could be successfully traced back to the source of drowning fluid. Our data suggest that this method could be a reliable tool assisting forensic drowning site inference.
Assuntos
Código de Barras de DNA Taxonômico , Diatomáceas/classificação , Afogamento/patologia , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Ribulose-Bifosfato Carboxilase/genética , Animais , Genética Forense , Patologia Legal , Modelos Animais , Reação em Cadeia da Polimerase , Coelhos , Sensibilidade e EspecificidadeRESUMO
Insertion/deletion polymorphisms (InDels), which possess the characteristics of low mutation rates and a short amplicon size, have been regarded as promising markers for forensic DNA analysis. InDels can be classified as bi-allelic or multi-allelic, depending on the number of alleles. Many studies have explored the use of bi-allelic InDels in forensic applications, such as individual identification and ancestry inference. However, multi-allelic InDels have received relatively little attention. In this study, InDels with 2-6 alleles and a minor allele frequency ≥0.01, in Chinese Southern Han (CHS), were retrieved from the 1000 Genomes Project Phase III. Based on the structural analysis of all retrieved InDels, 17 multi-allelic markers with mononucleotide homopolymer structures were selected and combined in one multiplex PCR reaction system. Sensitivity, species specificity and applicability in forensic case work of the multiplex were analyzed. A total of 218 unrelated individuals from a Chinese Han population were genotyped. The combined discriminatory power (CDP), the combined match probability (CMP) and the cumulative probability of exclusion (CPE) were 0.9999999999609, 3.91E-13 and 0.9956, respectively. The results demonstrated that this InDel multiplex panel was highly informative in the investigated population and most of the 26 populations of the 1000 Genomes Project. The data also suggested that multi-allelic InDel markers with monomeric base pair expansions are useful for forensic applications.
Assuntos
Alelos , Genética Forense/métodos , Mutação INDEL , Povo Asiático/genética , DNA/análise , Marcadores Genéticos , Genótipo , HumanosRESUMO
Diatom examination has always been used for the diagnosis of drowning in forensic practice. However, traditional examination of the microscopic features of diatom frustules is time-consuming and requires taxonomic expertise. In this study, we demonstrate a potential DNA-based method of inferring suspected drowning site using pyrosequencing (PSQ) of the V7 region of 18S ribosome DNA (18S rDNA) as a diatom DNA barcode. By employing a sparse representation-based AdvISER-M-PYRO algorithm, the original PSQ signals of diatom DNA mixtures were deciphered to determine the corresponding taxa of the composite diatoms. Additionally, we evaluated the possibility of correlating water samples to collection sites by analyzing the PSQ signal profiles of diatom mixtures contained in the water samples via multidimensional scaling. The results suggest that diatomaceous PSQ profile analysis could be used as a cost-effective method to deduce the geographical origin of an environmental bio-sample.
Assuntos
Código de Barras de DNA Taxonômico , DNA Ribossômico/genética , Análise de Sequência de DNA , Diatomáceas , Afogamento/diagnóstico , HumanosRESUMO
Visual information may convey different affective valences and induce our brain into different affective perceptions. Many studies have found that unpleasant stimuli could produce stronger emotional effects than pleasant stimuli could. Although there has been a notion that triangle is perceived as negative and circle as positive, there has been no systematic study to map the degrees of valence of shapes with different affective perceptions. Here, we employed four shapes (ellipse, triangle, and line-drawn happy and angry faces) to investigate the behavior and electrophysiological responses, in order to systematically study shape-induced affective perception. The reaction time delay and the event-related potential (ERP), particularly the early ERP component, were applied to find the associations with different affective perceptions. Our behavioral results showed that reaction time for angry face was significantly shorter than those for the other three types of stimuli (p < 0.05). In the ERP results, P1, N1, P2, and N2 amplitudes for angry face were significantly larger than those for happy face. Similarly, P1, N1, P2, and N2 amplitudes for triangle were significantly larger than those for ellipse. Particularly, P1 amplitude in the parietal lobe for angry face was the strongest, followed by happy face, triangle, and ellipse. Hence, this work found distinct electrophysiological evidence to map the shape-induced affective perception. It supports the hypothesis that affective strain would induce larger amplitude than affective ease does and strong affective stimuli induce larger amplitude than mild affective stimuli do.
Assuntos
Afeto/fisiologia , Encéfalo/fisiologia , Reconhecimento Visual de Modelos/fisiologia , Adulto , Eletroencefalografia , Potenciais Evocados Visuais , Feminino , Humanos , Masculino , Tempo de Reação , Adulto JovemRESUMO
OBJECTIVE: To assess the association of programmed cell death 1 (PDCD1) gene polymorphisms with the susceptibility and/or progression of colorectal cancer. METHODS: A hospital-based case-control study was carried out, which recruited 426 colorectal cancer patients and 500 healthy individuals. Five single nucleotide polymorphisms, namely rs36084323, rs11568821, rs2227981, rs2227982 and rs10204525, were selected for the study and genotyped with a polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) assay. RESULTS: The G allele of rs36084323 under a dominant model was associated with increased risk of advanced TNM staging of colorectal cancer progression (OR=1.59, 95%CI=1.02-2.48). Haplotypes G-G-C-T-A and A-G-C-C-G of the rs36084323, rs11568821, rs2227981, rs2227982, and rs10204525 were negatively associated with the occurrence of colorectal cancer. CONCLUSION: The G allele of rs36084323 is associated with increased risk of advanced TNM staging of colorectal cancer. Conversely, the incidence of colorectal cancer is negatively associated with the haplotypes G-G-C-T-A and A-G-C-C-G of rs36084323, rs11568821, rs2227981, rs2227982, and rs10204525.
Assuntos
Neoplasias Colorretais/genética , Predisposição Genética para Doença , Polimorfismo de Nucleotídeo Único , Receptor de Morte Celular Programada 1/genética , Povo Asiático/genética , Estudos de Casos e Controles , China/etnologia , Neoplasias Colorretais/patologia , Haplótipos , Humanos , Estadiamento de NeoplasiasRESUMO
During brain development, neural stem cells (NSCs) undergo multiple fate-switches to generate various neuronal subtypes and glial cells, exhibiting distinct transcriptomic profiles at different stages. However, full-length transcriptomic datasets of NSCs across different neurodevelopmental stages under similar experimental settings are lacking, which is essential for uncovering stage-specific transcriptional and post-transcriptional mechanisms underlying the fate commitment of NSCs. Here, we report the full-length transcriptome of mouse NSCs at five different stages during embryonic and postnatal development. We used fluorescent-activated cell sorting (FACS) to isolate CD133+Blbp+ NSCs from C57BL/6 transgenic mice that express enhanced green fluorescent protein (EGFP) under the control of a Blbp promoter. By integrating short- and long-read full-length RNA-seq, we created a transcriptomic dataset of gene and isoform expression profiles in NSCs at embryonic days 15.5, 17.5, and postnatal days 1.5, 8, and 60. This dataset provides a detailed characterization of full-length transcripts in NSCs at distinct developmental stages, which could be used as a resource for the neuroscience community to study NSC fate determination, neural development, and disease.
Assuntos
Células-Tronco Neurais , Transcriptoma , Animais , Diferenciação Celular , Camundongos , Camundongos Endogâmicos C57BL , Células-Tronco Neurais/metabolismo , NeurogêneseRESUMO
Lung adenocarcinoma (LUAD) and squamous carcinoma (LUSC) are two major subtypes of non-small cell lung cancer with distinct pathologic features and treatment paradigms. The heterogeneity can be attributed to genetic, transcriptional, and epigenetic parameters. Here, we established a multi-omics atlas, integrating 52 single-cell RNA sequencing and 2342 public bulk RNA sequencing. We investigated their differences in genetic amplification, cellular compositions, and expression modules. We revealed that LUAD and LUSC contained amplifications occurring selectively in subclusters of AT2 and basal cells, and had distinct cellular composition modules associated with poor survival of lung cancer. Malignant and stage-specific gene analyses further uncovered critical transcription factors and genes in tumor progression. Moreover, we identified subclusters with proliferating and differentiating properties in AT2 and basal cells. Overexpression assays of ten genes, including sub-cluster markers AQP5 and KPNA2, further indicated their functional roles, providing potential targets for early diagnosis and treatment in lung cancer.
Assuntos
Adenocarcinoma de Pulmão , Biomarcadores Tumorais , Regulação Neoplásica da Expressão Gênica , Neoplasias Pulmonares , Análise de Sequência de RNA , Análise de Célula Única , Transcrição Gênica , Adenocarcinoma de Pulmão/genética , Adenocarcinoma de Pulmão/metabolismo , Biomarcadores Tumorais/biossíntese , Biomarcadores Tumorais/genética , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/metabolismoRESUMO
An amplification-based single-molecule fluorescence in situ hybridization (asmFISH) assay is introduced that exploits improved probe design for highly specific imaging of individual transcripts in fixed cells and tissues. In this method, a pair of DNA ligation probes are ligated on RNA templates upon specific hybridization, followed by probe circularization based on enzymatic DNA ligation and rolling circle amplification for signal boosting. The method is more efficient and specific than the padlock probe assay for detection of the same RNA molecules and discrimination of single nucleotide polymorphisms. Moreover, asmFISH is a versatile method which can be applied not only to cultured cells, but also to fresh frozen and formalin-fixed, paraffin-embedded tissue sections.
Assuntos
Sondas de DNA/química , Hibridização in Situ Fluorescente , RNA/análise , Animais , Células Cultivadas , Humanos , Camundongos , Camundongos Endogâmicos C57BL , Polimorfismo de Nucleotídeo Único/genética , RNA/genéticaRESUMO
The SARS-CoV-2 virus, the causative agent of COVID-19, is undergoing constant mutation. Here, we utilized an integrative approach combining epidemiology, virus genome sequencing, clinical phenotyping, and experimental validation to locate mutations of clinical importance. We identified 35 recurrent variants, some of which are associated with clinical phenotypes related to severity. One variant, containing a deletion in the Nsp1-coding region (Δ500-532), was found in more than 20% of our sequenced samples and associates with higher RT-PCR cycle thresholds and lower serum IFN-ß levels of infected patients. Deletion variants in this locus were found in 37 countries worldwide, and viruses isolated from clinical samples or engineered by reverse genetics with related deletions in Nsp1 also induce lower IFN-ß responses in infected Calu-3 cells. Taken together, our virologic surveillance characterizes recurrent genetic diversity and identified mutations in Nsp1 of biological and clinical importance, which collectively may aid molecular diagnostics and drug design.
Assuntos
COVID-19/imunologia , COVID-19/virologia , Interferon Tipo I/imunologia , SARS-CoV-2/genética , SARS-CoV-2/imunologia , Proteínas não Estruturais Virais/genética , Células A549 , Adolescente , Adulto , Idoso , Idoso de 80 Anos ou mais , Animais , Sequência de Bases , COVID-19/sangue , Linhagem Celular , Criança , Pré-Escolar , Chlorocebus aethiops , Feminino , Deleção de Genes , Genômica , Células HEK293 , Humanos , Lactente , Interferon Tipo I/sangue , Interferon beta/sangue , Interferon beta/metabolismo , Masculino , Pessoa de Meia-Idade , Epidemiologia Molecular , Genética Reversa , Células Vero , Proteínas não Estruturais Virais/imunologia , Adulto JovemRESUMO
Malaria is a devastating infectious disease and the immune response is complex and dynamic during a course of a malarial infection. Rodent malaria models allow detailed time-series studies of the host response in multiple organs. Here, we describe two comprehensive datasets containing host transcriptomic data from both the blood and spleen throughout an acute blood stage infection of virulent or avirulent Plasmodium chabaudi infection in C57BL/6 mice. The mRNA expression profiles were generated using Illumina BeadChip microarray. These datasets provide a groundwork for comprehensive and comparative studies on host gene expression in early, acute and recovering phases of a blood stage infection in both the blood and spleen, to explore the interaction between the two, and importantly to investigate whether these responses differ in virulent and avirulent infections.
Assuntos
Sangue/metabolismo , Malária/metabolismo , Baço/metabolismo , Transcriptoma , Animais , Sangue/parasitologia , Modelos Animais de Doenças , Camundongos , Camundongos Endogâmicos C57BL , RNA Mensageiro , Baço/parasitologiaRESUMO
Conventional flow cytometry has been widely used for high-throughput single-cell gene expression analysis using specific antibody staining. However, this is limited by the availability of high-quality antibodies. We developed a novel flow cytometry RNA detection technique termed RCA-Flow for single-cell RNA expression analysis. We showed that it is able to analyze not only mRNAs but also microRNAs and circular RNAs that are otherwise difficult to analyze by other flow cytometry techniques. The versatility for high-throughput analysis of different types of RNA molecules makes our method possess great potential for both biomedical and clinical applications.
Assuntos
MicroRNAs , Citometria de Fluxo , MicroRNAs/genética , RNA Mensageiro , Análise de Célula ÚnicaRESUMO
Insertion/deletion polymorphisms (indels), which combine the advantages of both short tandem repeats and single-nucleotide polymorphisms, are suitable for parentage testing. To overcome the limitations of the low polymorphism of di-allelic indels, we constructed a set of haplotypes with physically linked, multi-allelic indels. Candidate haplotypes were selected from the 1000 Genomes Project database, and were subject to the following criteria for inclusion: (i) each marker must have a minimum allele frequency (MAF) of ≥0.1 in the Han population of China; (ii) markers must exist in a non-coding region; (iii) the physical distance between a pair of candidate indels must be <500â¯bp; (iv) the allele length variation of each indel from 1 to 20â¯bp; (v) different haplotypes must be located on different chromosomes or chromosomal arms, or be more than 10 Mb apart if on the same chromosomal arm; and (vi) they must not be located across a recombination hotspot. A multiplex system with 11 haplotype markers, comprising 22 tri-allelic indel loci distributed over 10 chromosomes was developed. To validate the multiplex panel, we investigated the haplotype distribution in sets of two and three-generation pedigrees. The results demonstrated that the haplotypes consisting of multi-allelic indel markers exhibited higher polymorphism than a single indel locus, and thus provide Supplementary information for forensic kinship identification.
Assuntos
Genética Forense/métodos , Haplótipos , Mutação INDEL , Alelos , Eletroforese Capilar , Frequência do Gene , Genótipo , Humanos , Reação em Cadeia da PolimeraseRESUMO
Current research focusing on forensic personal identification, phenotype inference and ancestry information on single-nucleotide polymorphisms (SNPs) has been widely reported. In the present study, we focused on tetra-allelic SNPs in the Chinese Han population. A total of 48 tetra-allelic SNPs were screened out from the Chinese Han population of the 1000 Genomes Database, including Chinese Han in Beijing (CHB) and Chinese Han South (CHS). Considering the forensic genetic requirement for the polymorphisms, only 11 tetra-allelic SNPs with a heterozygosity >0.06 were selected for further multiplex panel construction. In order to meet the demands of personal identification and parentage identification, an additional 8 tri-allelic SNPs were combined into the final multiplex panel. To ensure application in the degraded DNA analysis, all the PCR products were designed to be 87-188â¯bp. Employing multiple PCR reactions and SNaPshot minisequencing, 511 unrelated Chinese Han individuals from Sichuan were genotyped. The combined match probability (CMP), combined discrimination power (CDP), and cumulative probability of exclusion (CPE) of the panel were 6.07â¯×â¯10-11, 0.9999999999393 and 0.996764, respectively. Based on the population data retrieved from the 1000 Genomes Project, Fst values between Chinese Han in Sichuan (SCH) and all the populations included in the 1000 Genomes Project were calculated. The results indicated that two SNPs in this panel may contain ancestry information and may be used as markers of forensic biogeographical ancestry inference.