Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 72
Filtrar
1.
Adv Mater Technol ; 9(2)2024 Jan 22.
Artículo en Inglés | MEDLINE | ID: mdl-38645306

RESUMEN

Single-cell genomics has revolutionized tissue analysis by revealing the genetic program of individual cells. The key aspect of the technology is the use of barcoded beads to unambiguously tag sequences originating from a single cell. The generation of unique barcodes on beads is mainly achieved by split-pooling methods, which are labor-intensive due to repeated washing steps. Towards the automation of the split-pooling method, we developed a simple method to magnetize hydrogel beads. We show that these hydrogel beads provide increased yields and washing efficiencies for purification procedures. They are also fully compatible with single-cell sequencing using the BAG-Seq workflow. Our work opens the automation of the split-pooling technique, which will improve single-cell genomic workflows.

2.
Cell Genom ; 3(6): 100319, 2023 Jun 14.
Artículo en Inglés | MEDLINE | ID: mdl-37388917

RESUMEN

Studying thousands of families, we find siblings concordant for autism share more of their parental genomes than expected by chance, and discordant siblings share less, consistent with a role of transmission in autism incidence. The excess sharing of the father is highly significant (p value of 0.0014), with less significance for the mother (p value of 0.31). To compare parental sharing, we adjust for differences in meiotic recombination to obtain a p value of 0.15 that they are shared equally. These observations are contrary to certain models in which the mother carries a greater load than the father. Nevertheless, we present models in which greater sharing of the father is observed even though the mother carries a greater load. More generally, our observations of sharing establish quantitative constraints that any complete genetic model of autism must satisfy, and our methods may be applicable to other complex disorders.

3.
Nucleic Acids Res ; 50(20): e116, 2022 11 11.
Artículo en Inglés | MEDLINE | ID: mdl-36095132

RESUMEN

Tandem repeats of simple sequence motifs, also known as microsatellites, are abundant in the genome. Because their repeat structure makes replication error-prone, variant microsatellite lengths are often generated during germline and other somatic expansions. As such, microsatellite length variations can serve as markers for cancer. However, accurate error-free measurement of microsatellite lengths is difficult with current methods precisely because of this high error rate during amplification. We have solved this problem by using partial mutagenesis to disrupt enough of the repeat structure of initial templates so that their sequence lengths replicate faithfully. In this work, we use bisulfite mutagenesis to convert a C to a U, later read as T. Compared to untreated templates, we achieve three orders of magnitude reduction in the error rate per round of replication. By requiring agreement from two independent first copies of an initial template, we reach error rates below one in a million. We apply this method to a thousand microsatellite loci from the human genome, revealing microsatellite length distributions not observable without mutagenesis.


Asunto(s)
Genoma Humano , Repeticiones de Microsatélite , Mutagénesis Sitio-Dirigida , Humanos , Repeticiones de Microsatélite/genética , Mutagénesis Sitio-Dirigida/métodos
4.
Nucleic Acids Res ; 50(18): e103, 2022 10 14.
Artículo en Inglés | MEDLINE | ID: mdl-35822882

RESUMEN

Short-read sequencers provide highly accurate reads at very low cost. Unfortunately, short reads are often inadequate for important applications such as assembly in complex regions or phasing across distant heterozygous sites. In this study, we describe novel bench protocols and algorithms to obtain haplotype-phased sequence assemblies with ultra-low error for regions 10 kb and longer using short reads only. We accomplish this by imprinting each template strand from a target region with a dense and unique mutation pattern. The mutation process randomly and independently converts ∼50% of cytosines to uracils. Sequencing libraries are made from both mutated and unmutated templates. Using de Bruijn graphs and paired-end read information, we assemble each mutated template and use the unmutated library to correct the mutated bases. Templates are partitioned into two or more haplotypes, and the final haplotypes are assembled and corrected for residual template mutations and PCR errors. With sufficient template coverage, the final assemblies have per-base error rates below 10-9. We demonstrate this method on a four-member nuclear family, correctly assembling and phasing three genomic intervals, including the highly polymorphic HLA-B gene.


Asunto(s)
Genoma , Genómica , Algoritmos , Antígenos HLA-B , Haplotipos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Mutagénesis , Análisis de Secuencia de ADN/métodos
5.
Commun Biol ; 4(1): 1026, 2021 09 01.
Artículo en Inglés | MEDLINE | ID: mdl-34471188

RESUMEN

Autism arises in high and low-risk families. De novo mutation contributes to autism incidence in low-risk families as there is a higher incidence in the affected of the simplex families than in their unaffected siblings. But the extent of contribution in low-risk families cannot be determined solely from simplex families as they are a mixture of low and high-risk. The rate of de novo mutation in nearly pure populations of high-risk families, the multiplex families, has not previously been rigorously determined. Moreover, rates of de novo mutation have been underestimated from studies based on low resolution microarrays and whole exome sequencing. Here we report on findings from whole genome sequence (WGS) of both simplex families from the Simons Simplex Collection (SSC) and multiplex families from the Autism Genetic Resource Exchange (AGRE). After removing the multiplex samples with excessive cell-line genetic drift, we find that the contribution of de novo mutation in multiplex is significantly smaller than the contribution in simplex. We use WGS to provide high resolution CNV profiles and to analyze more than coding regions, and revise upward the rate in simplex autism due to an excess of de novo events targeting introns. Based on this study, we now estimate that de novo events contribute to 52-67% of cases of autism arising from low risk families, and 30-39% of cases of all autism.


Asunto(s)
Trastorno Autístico/epidemiología , Predisposición Genética a la Enfermedad/genética , Mutación , Adulto , Trastorno del Espectro Autista , Trastorno Autístico/genética , Femenino , Humanos , Incidencia , Masculino , Persona de Mediana Edad , New York/epidemiología , Factores de Riesgo , Adulto Joven
6.
Cancer Discov ; 10(10): 1566-1589, 2020 10.
Artículo en Inglés | MEDLINE | ID: mdl-32703770

RESUMEN

Pancreatic ductal adenocarcinoma (PDAC) is the most lethal common malignancy, with little improvement in patient outcomes over the past decades. Recently, subtypes of pancreatic cancer with different prognoses have been elaborated; however, the inability to model these subtypes has precluded mechanistic investigation of their origins. Here, we present a xenotransplantation model of PDAC in which neoplasms originate from patient-derived organoids injected directly into murine pancreatic ducts. Our model enables distinction of the two main PDAC subtypes: intraepithelial neoplasms from this model progress in an indolent or invasive manner representing the classical or basal-like subtypes of PDAC, respectively. Parameters that influence PDAC subtype specification in this intraductal model include cell plasticity and hyperactivation of the RAS pathway. Finally, through intratumoral dissection and the direct manipulation of RAS gene dosage, we identify a suite of RAS-regulated secreted and membrane-bound proteins that may represent potential candidates for therapeutic intervention in patients with PDAC. SIGNIFICANCE: Accurate modeling of the molecular subtypes of pancreatic cancer is crucial to facilitate the generation of effective therapies. We report the development of an intraductal organoid transplantation model of pancreatic cancer that models the progressive switching of subtypes, and identify stochastic and RAS-driven mechanisms that determine subtype specification.See related commentary by Pickering and Morton, p. 1448.This article is highlighted in the In This Issue feature, p. 1426.


Asunto(s)
Adenocarcinoma/genética , Regulación Neoplásica de la Expresión Génica/genética , Conductos Pancreáticos/trasplante , Animales , Carcinoma Ductal Pancreático , Modelos Animales de Enfermedad , Humanos , Ratones , Pronóstico
7.
Elife ; 92020 05 13.
Artículo en Inglés | MEDLINE | ID: mdl-32401198

RESUMEN

Copy number alterations (CNAs) play an important role in molding the genomes of breast cancers and have been shown to be clinically useful for prognostic and therapeutic purposes. However, our knowledge of intra-tumoral genetic heterogeneity of this important class of somatic alterations is limited. Here, using single-cell sequencing, we comprehensively map out the facets of copy number alteration heterogeneity in a cohort of breast cancer tumors. Ou/var/www/html/elife/12-05-2020/backup/r analyses reveal: genetic heterogeneity of non-tumor cells (i.e. stroma) within the tumor mass; the extent to which copy number heterogeneity impacts breast cancer genomes and the importance of both the genomic location and dosage of sub-clonal events; the pervasive nature of genetic heterogeneity of chromosomal amplifications; and the association of copy number heterogeneity with clinical and biological parameters such as polyploidy and estrogen receptor negative status. Our data highlight the power of single-cell genomics in dissecting, in its many forms, intra-tumoral genetic heterogeneity of CNAs, the magnitude with which CNA heterogeneity affects the genomes of breast cancers, and the potential importance of CNA heterogeneity in phenomena such as therapeutic resistance and disease relapse.


Cells in the body remain healthy by tightly preventing and repairing random changes, or mutations, in their genetic material. In cancer cells, however, these mechanisms can break down. When these cells grow and multiply, they can then go on to accumulate many mutations. As a result, cancer cells in the same tumor can each contain a unique combination of genetic changes. This genetic heterogeneity has the potential to affect how cancer responds to treatment, and is increasingly becoming appreciated clinically. For example, if a drug only works against cancer cells carrying a specific mutation, any cells lacking this genetic change will keep growing and cause a relapse. However, it is still difficult to quantify and understand genetic heterogeneity in cancer. Copy number alterations (or CNAs) are a class of mutation where large and small sections of genetic material are gained or lost. This can result in cells that have an abnormal number of copies of the genes in these sections. Here, Baslan et al. set out to explore how CNAs might vary between individual cancer cells within the same tumor. To do so, thousands of individual cancer cells were isolated from human breast tumors, and a technique called single-cell genome sequencing used to screen the genetic information of each of them. These experiments confirmed that CNAs did differ ­ sometimes dramatically ­ between patients and among cells taken from the same tumor. For example, many of the cells carried extra copies of well-known cancer genes important for treatment, but the exact number of copies varied between cells. This heterogeneity existed for individual genes as well as larger stretches of DNA: this was the case, for instance, for an entire section of chromosome 8, a region often affected in breast and other tumors. The work by Baslan et al. captures the sheer extent of genetic heterogeneity in cancer and in doing so, highlights the power of single-cell genome sequencing. In the future, a finer understanding of the genetic changes present at the level of an individual cancer cell may help clinicians to manage the disease more effectively.


Asunto(s)
Biomarcadores de Tumor/genética , Neoplasias de la Mama/genética , Variaciones en el Número de Copia de ADN , Dosificación de Gen , Heterogeneidad Genética , Genómica , Análisis de la Célula Individual , Secuenciación Completa del Genoma , Neoplasias de la Mama/mortalidad , Neoplasias de la Mama/patología , Neoplasias de la Mama/terapia , Ensayos Clínicos Fase II como Asunto , Femenino , Predisposición Genética a la Enfermedad , Humanos , Fenotipo , Pronóstico , RNA-Seq
8.
JCO Clin Cancer Inform ; 4: 464-471, 2020 05.
Artículo en Inglés | MEDLINE | ID: mdl-32432904

RESUMEN

PURPOSE: Copy-number profiling of multiple individual cells from sparse sequencing may be used to reveal a detailed picture of genomic heterogeneity and clonal organization in a tissue biopsy specimen. We sought to provide a comprehensive computational pipeline for single-cell genomics, to facilitate adoption of this molecular technology for basic and translational research. MATERIALS AND METHODS: The pipeline comprises software tools programmed in Python and in R and depends on Bowtie, HISAT2, Matplotlib, and Qt. It is installed and used with Anaconda. RESULTS: Here we describe a complete pipeline for sparse single-cell genomic data, encompassing all steps of single-nucleus DNA copy-number profiling, from raw sequence processing to clonal structure analysis and visualization. For the latter, a specialized graphical user interface termed the single-cell genome viewer (SCGV) is provided. With applications to cancer diagnostics in mind, the SCGV allows for zooming and linkage to the University of California at Santa Cruz Genome Browser from each of the multiple integrated views of single-cell copy-number profiles. The latter can be organized by clonal substructure or by any of the associated metadata such as anatomic location and histologic characterization. CONCLUSION: The pipeline is available as open-source software for Linux and OS X. Its modular structure, extensive documentation, and ease of deployment using Anaconda facilitate its adoption by researchers and practitioners of single-cell genomics. With open-source availability and Massachusetts Institute of Technology licensing, it provides a basis for additional development by the cancer bioinformatics community.


Asunto(s)
Biología Computacional , Programas Informáticos , Genoma , Genómica , Humanos
9.
Nucleic Acids Res ; 48(7): e40, 2020 04 17.
Artículo en Inglés | MEDLINE | ID: mdl-32083660

RESUMEN

Measuring minimal residual disease in cancer has applications for prognosis, monitoring treatment and detection of recurrence. Simple sequence-based methods to detect nucleotide substitution variants have error rates (about 10-3) that limit sensitive detection. We developed and characterized the performance of MASQ (multiplex accurate sensitive quantitation), a method with an error rate below 10-6. MASQ counts variant templates accurately in the presence of millions of host genomes by using tags to identify each template and demanding consensus over multiple reads. Since the MASQ protocol multiplexes 50 target loci, we can both integrate signal from multiple variants and capture subclonal response to treatment. Compared to existing methods for variant detection, MASQ achieves an excellent combination of sensitivity, specificity and yield. We tested MASQ in a pilot study in acute myeloid leukemia (AML) patients who entered complete remission. We detect leukemic variants in the blood and bone marrow samples of all five patients, after induction therapy, at levels ranging from 10-2 to nearly 10-6. We observe evidence of sub-clonal structure and find higher target variant frequencies in patients who go on to relapse, demonstrating the potential for MASQ to quantify residual disease in AML.


Asunto(s)
Leucemia Mieloide Aguda/genética , Algoritmos , Genómica/métodos , Humanos , Leucemia Mieloide Aguda/terapia , Mutación , Neoplasia Residual , Proyectos Piloto , Recurrencia , Inducción de Remisión , Secuenciación Completa del Genoma
10.
Dev Cell ; 52(4): 413-428.e6, 2020 02 24.
Artículo en Inglés | MEDLINE | ID: mdl-32097652

RESUMEN

High levels of cancer aneuploidy are frequently associated with poor prognosis. To examine the relationship between aneuploidy and cancer progression, we analyzed a series of congenic cell lines that harbor single extra chromosomes. We found that across 13 different trisomic cell lines, 12 trisomies suppressed invasiveness or were largely neutral, while a single trisomy increased metastatic behavior by triggering a partial epithelial-mesenchymal transition. In contrast, we discovered that chromosomal instability activates cGAS/STING signaling but strongly suppresses invasiveness. By analyzing patient copy-number data, we demonstrate that specific aneuploidies are associated with distinct outcomes, and the acquisition of certain aneuploidies is in fact linked with a favorable prognosis. Thus, aneuploidy is not a uniform driver of malignancy, and different aneuploidies can uniquely influence tumor progression. At the same time, the gain of a single chromosome is capable of inducing a profound cell state transition, thereby linking genomic plasticity, phenotypic plasticity, and metastasis.


Asunto(s)
Aneuploidia , Movimiento Celular , Inestabilidad Cromosómica , Cromosomas Humanos Par 5/genética , Neoplasias del Colon/patología , Neoplasias del Colon/prevención & control , Transición Epitelial-Mesenquimal , Animales , Apoptosis , Proliferación Celular , Neoplasias del Colon/genética , Femenino , Humanos , Masculino , Proteínas de la Membrana/genética , Proteínas de la Membrana/metabolismo , Ratones , Ratones Desnudos , Invasividad Neoplásica , Nucleotidiltransferasas/genética , Nucleotidiltransferasas/metabolismo , Células Tumorales Cultivadas , Ensayos Antitumor por Modelo de Xenoinjerto
11.
Genome Res ; 30(1): 49-61, 2020 01.
Artículo en Inglés | MEDLINE | ID: mdl-31727682

RESUMEN

We show the use of 5'-Acrydite oligonucleotides to copolymerize single-cell DNA or RNA into balls of acrylamide gel (BAGs). Combining this step with split-and-pool techniques for creating barcodes yields a method with advantages in cost and scalability, depth of coverage, ease of operation, minimal cross-contamination, and efficient use of samples. We perform DNA copy number profiling on mixtures of cell lines, nuclei from frozen prostate tumors, and biopsy washes. As applied to RNA, the method has high capture efficiency of transcripts and sufficient consistency to clearly distinguish the expression patterns of cell lines and individual nuclei from neurons dissected from the mouse brain. By using varietal tags (UMIs) to achieve sequence error correction, we show extremely low levels of cross-contamination by tracking source-specific SNVs. The method is readily modifiable, and we will discuss its adaptability and diverse applications.


Asunto(s)
Acrilamida , Ácidos Nucleicos , Análisis de la Célula Individual/métodos , Acrilamida/química , ADN , Contaminación de ADN , Variaciones en el Número de Copia de ADN , Dosificación de Gen , Perfilación de la Expresión Génica/métodos , Perfilación de la Expresión Génica/normas , Biblioteca de Genes , Humanos , Neoplasias/genética , Neoplasias/metabolismo , Neoplasias/patología , Ácidos Nucleicos/química , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Análisis de Secuencia por Matrices de Oligonucleótidos/normas , Polimerizacion , ARN , Análisis de la Célula Individual/normas
12.
Proc Natl Acad Sci U S A ; 115(8): E1859-E1866, 2018 02 20.
Artículo en Inglés | MEDLINE | ID: mdl-29434036

RESUMEN

In individuals with autism spectrum disorder (ASD), de novo mutations have previously been shown to be significantly correlated with lower IQ but not with the core characteristics of ASD: deficits in social communication and interaction and restricted interests and repetitive patterns of behavior. We extend these findings by demonstrating in the Simons Simplex Collection that damaging de novo mutations in ASD individuals are also significantly and convincingly correlated with measures of impaired motor skills. This correlation is not explained by a correlation between IQ and motor skills. We find that IQ and motor skills are distinctly associated with damaging mutations and, in particular, that motor skills are a more sensitive indicator of mutational severity than is IQ, as judged by mutational type and target gene. We use this finding to propose a combined classification of phenotypic severity: mild (little impairment of either), moderate (impairment mainly to motor skills), and severe (impairment of both IQ and motor skills).


Asunto(s)
Trastorno del Espectro Autista/genética , Destreza Motora/fisiología , Niño , Femenino , Genotipo , Humanos , Masculino , Mutación
13.
Nucleic Acids Res ; 46(2): e10, 2018 01 25.
Artículo en Inglés | MEDLINE | ID: mdl-29161423

RESUMEN

We introduce a new protocol, mutational sequencing or muSeq, which uses sodium bisulfite to randomly deaminate unmethylated cytosines at a fixed and tunable rate. The muSeq protocol marks each initial template molecule with a unique mutation signature that is present in every copy of the template, and in every fragmented copy of a copy. In the sequenced read data, this signature is observed as a unique pattern of C-to-T or G-to-A nucleotide conversions. Clustering reads with the same conversion pattern enables accurate count and long-range assembly of initial template molecules from short-read sequence data. We explore count and low-error sequencing by profiling 135 000 restriction fragments in a PstI representation, demonstrating that muSeq improves copy number inference and significantly reduces sporadic sequencer error. We explore long-range assembly in the context of cDNA, generating contiguous transcript clusters greater than 3,000 bp in length. The muSeq assemblies reveal transcriptional diversity not observable from short-read data alone.


Asunto(s)
ADN/química , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Sulfitos/química , Moldes Genéticos , ADN/genética , Genómica/métodos , Mutación , Reproducibilidad de los Resultados
14.
Cancer Res ; 78(2): 348-358, 2018 01 15.
Artículo en Inglés | MEDLINE | ID: mdl-29180472

RESUMEN

A distinction between indolent and aggressive disease is a major challenge in diagnostics of prostate cancer. As genetic heterogeneity and complexity may influence clinical outcome, we have initiated studies on single tumor cell genomics. In this study, we demonstrate that sparse DNA sequencing of single-cell nuclei from prostate core biopsies is a rich source of quantitative parameters for evaluating neoplastic growth and aggressiveness. These include the presence of clonal populations, the phylogenetic structure of those populations, the degree of the complexity of copy-number changes in those populations, and measures of the proportion of cells with clonal copy-number signatures. The parameters all showed good correlation to the measure of prostatic malignancy, the Gleason score, derived from individual prostate biopsy tissue cores. Remarkably, a more accurate histopathologic measure of malignancy, the surgical Gleason score, agrees better with these genomic parameters of diagnostic biopsy than it does with the diagnostic Gleason score and related measures of diagnostic histopathology. This is highly relevant because primary treatment decisions are dependent upon the biopsy and not the surgical specimen. Thus, single-cell analysis has the potential to augment traditional core histopathology, improving both the objectivity and accuracy of risk assessment and inform treatment decisions.Significance: Genomic analysis of multiple individual cells harvested from prostate biopsies provides an indepth view of cell populations comprising a prostate neoplasm, yielding novel genomic measures with the potential to improve the accuracy of diagnosis and prognosis in prostate cancer. Cancer Res; 78(2); 348-58. ©2017 AACR.


Asunto(s)
Biomarcadores de Tumor/genética , Genómica/métodos , Neoplasias de la Próstata/diagnóstico , Análisis de la Célula Individual/métodos , Anciano , Anciano de 80 o más Años , Humanos , Masculino , Persona de Mediana Edad , Clasificación del Tumor , Estadificación de Neoplasias , Filogenia , Prostatectomía , Neoplasias de la Próstata/genética , Neoplasias de la Próstata/cirugía , Medición de Riesgo
15.
Trends Mol Med ; 23(7): 594-603, 2017 07.
Artículo en Inglés | MEDLINE | ID: mdl-28587830

RESUMEN

Here, we explore the potential of single-cell genomic analysis in blood for early detection of cancer; we consider a method that screens the presence of recurrent patterns of copy number (CN) alterations using sparse single-cell sequencing. We argue for feasibility, based on in silico analysis of existing single-cell data and cancer CN profiles. Sampling procedures from existing diploid single cells can render data for a cell with any given profile. Sampling from multiple published tumor profiles can interrogate cancer clonality via an algorithm that tests the multiplicity of close pairwise similarities among single-cell cancer genomes. The majority of common solid cancers would be detectable in this manner. As any early detection method must be verifiable and actionable, we describe how further analysis of suspect cells can aid in determining risk and anatomic origin. Future affordability rests on currently available procedures for tumor cell enrichment and inexpensive methods for single-cell analysis.


Asunto(s)
Simulación por Computador , Dosificación de Gen , Genoma Humano , Neoplasias , Animales , Humanos , Neoplasias/diagnóstico , Neoplasias/genética
16.
Proc Natl Acad Sci U S A ; 114(27): 7073-7076, 2017 07 03.
Artículo en Inglés | MEDLINE | ID: mdl-28630308

RESUMEN

We develop a method of analysis [affected to discordant sibling pairs (A2DS)] that tests if shared variants contribute to a disorder. Using a standard measure of genetic relation, test individuals are compared with a cohort of discordant sibling pairs (CDS) to derive a comparative similarity score. We ask if a test individual is more similar to an unrelated affected than to the unrelated unaffected sibling from the CDS and then, sum over such individuals and pairs. Statistical significance is judged by randomly permuting the affected status in the CDS. In the analysis of published genotype data from the Simons Simplex Collection (SSC) and the Autism Genetic Resource Exchange (AGRE) cohorts of children with autism spectrum disorder (ASD), we find strong statistical significance that the affected are more similar to the affected than to the unaffected of the CDS (P value ∼ 0.00001). Fathers in multiplex families have marginally greater similarity (P value = 0.02) to unrelated affected individuals. These results do not depend on ethnic matching or gender.


Asunto(s)
Trastorno Autístico/genética , Trastorno Autístico/fisiopatología , Hermanos , Trastorno del Espectro Autista/genética , Trastorno del Espectro Autista/fisiopatología , Niño , Preescolar , Estudios de Cohortes , Simulación por Computador , Salud de la Familia , Femenino , Genotipo , Humanos , Masculino , Modelos Estadísticos , Polimorfismo de Nucleótido Simple , Factores Sexuales
17.
Nat Protoc ; 11(12): 2529-2548, 2016 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-27854363

RESUMEN

As the second most common type of variation in the human genome, insertions and deletions (indels) have been linked to many diseases, but the discovery of indels of more than a few bases in size from short-read sequencing data remains challenging. Scalpel (http://scalpel.sourceforge.net) is an open-source software for reliable indel detection based on the microassembly technique. It has been successfully used to discover mutations in novel candidate genes for autism, and it is extensively used in other large-scale studies of human diseases. This protocol gives an overview of the algorithm and describes how to use Scalpel to perform highly accurate indel calling from whole-genome and whole-exome sequencing data. We provide detailed instructions for an exemplary family-based de novo study, but we also characterize the other two supported modes of operation: single-sample and somatic analysis. Indel normalization, visualization and annotation of the mutations are also illustrated. Using a standard server, indel discovery and characterization in the exonic regions of the example sequencing data can be completed in ∼5 h after read mapping.


Asunto(s)
Análisis Mutacional de ADN/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Mutación INDEL , Alelos , Genómica , Humanos , Anotación de Secuencia Molecular , Polimorfismo de Nucleótido Simple
18.
Genome Res ; 26(6): 844-51, 2016 06.
Artículo en Inglés | MEDLINE | ID: mdl-27197213

RESUMEN

Copy number variants (CNVs) underlie a significant amount of genetic diversity and disease. CNVs can be detected by a number of means, including chromosomal microarray analysis (CMA) and whole-genome sequencing (WGS), but these approaches suffer from either limited resolution (CMA) or are highly expensive for routine screening (both CMA and WGS). As an alternative, we have developed a next-generation sequencing-based method for CNV analysis termed SMASH, for short multiply aggregated sequence homologies. SMASH utilizes random fragmentation of input genomic DNA to create chimeric sequence reads, from which multiple mappable tags can be parsed using maximal almost-unique matches (MAMs). The SMASH tags are then binned and segmented, generating a profile of genomic copy number at the desired resolution. Because fewer reads are necessary relative to WGS to give accurate CNV data, SMASH libraries can be highly multiplexed, allowing large numbers of individuals to be analyzed at low cost. Increased genomic resolution can be achieved by sequencing to higher depth.


Asunto(s)
Dosificación de Gen , Análisis de Secuencia de ADN , Línea Celular Tumoral , Biología Computacional , Variaciones en el Número de Copia de ADN , Femenino , Genoma Humano , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Masculino , Programas Informáticos
20.
Nat Methods ; 12(11): 1058-60, 2015 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-26344043

RESUMEN

We present Ginkgo (http://qb.cshl.edu/ginkgo), a user-friendly, open-source web platform for the analysis of single-cell copy-number variations (CNVs). Ginkgo automatically constructs copy-number profiles of cells from mapped reads and constructs phylogenetic trees of related cells. We validated Ginkgo by reproducing the results of five major studies. After comparing three commonly used single-cell amplification techniques, we concluded that degenerate oligonucleotide-primed PCR is the most consistent for CNV analysis.


Asunto(s)
Biología Computacional , Variaciones en el Número de Copia de ADN , Genoma Humano , Oligonucleótidos/genética , Algoritmos , Animales , Automatización , Análisis por Conglomerados , Drosophila , Femenino , Dosificación de Gen , Genoma , Humanos , Internet , Neoplasias Pulmonares/diagnóstico , Neoplasias Pulmonares/genética , Masculino , Ratones , Pan troglodytes , Filogenia , Reacción en Cadena de la Polimerasa , Ratas , Reproducibilidad de los Resultados , Cromosomas Sexuales , Carcinoma Pulmonar de Células Pequeñas/diagnóstico , Carcinoma Pulmonar de Células Pequeñas/genética , Programas Informáticos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...