Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 20
Filtrar
1.
Genome Res ; 33(7): 1015-1022, 2023 07.
Artículo en Inglés | MEDLINE | ID: mdl-37349109

RESUMEN

Although rates of recombination events across the genome (genetic maps) are fundamental to genetic research, the majority of current studies only use one standard map. There is evidence suggesting population differences in genetic maps, and thus estimating population-specific maps, are of interest. Although the recent availability of biobank-scale data offers such opportunities, current methods are not efficient at leveraging very large sample sizes. The most accurate methods are still linkage disequilibrium (LD)-based methods that are only tractable for a few hundred samples. In this work, we propose a fast and memory-efficient method for estimating genetic maps from population genotyping data. Our method, FastRecomb, leverages the efficient positional Burrows-Wheeler transform (PBWT) data structure for counting IBD segment boundaries as potential recombination events. We used PBWT blocks to avoid redundant counting of pairwise matches. Moreover, we used a panel-smoothing technique to reduce the noise from errors and recent mutations. Using simulation, we found that FastRecomb achieves state-of-the-art performance at 10-kb resolution, in terms of correlation coefficients between the estimated map and the ground truth. This is mainly because FastRecomb can effectively take advantage of large panels comprising more than hundreds of thousands of haplotypes. At the same time, other methods lack the efficiency to handle such data. We believe further refinement of FastRecomb would deliver more accurate genetic maps for the genetics community.


Asunto(s)
Bancos de Muestras Biológicas , Genoma , Haplotipos , Desequilibrio de Ligamiento , Polimorfismo de Nucleótido Simple , Recombinación Genética
2.
PLoS Genet ; 19(12): e1011057, 2023 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-38039339

RESUMEN

Although genome-wide association studies (GWAS) have identified tens of thousands of genetic loci, the genetic architecture is still not fully understood for many complex traits. Most GWAS and sequencing association studies have focused on single nucleotide polymorphisms or copy number variations, including common and rare genetic variants. However, phased haplotype information is often ignored in GWAS or variant set tests for rare variants. Here we leverage the identity-by-descent (IBD) segments inferred from a random projection-based IBD detection algorithm in the mapping of genetic associations with complex traits, to develop a computationally efficient statistical test for IBD mapping in biobank-scale cohorts. We used sparse linear algebra and random matrix algorithms to speed up the computation, and a genome-wide IBD mapping scan of more than 400,000 samples finished within a few hours. Simulation studies showed that our new method had well-controlled type I error rates under the null hypothesis of no genetic association in large biobank-scale cohorts, and outperformed traditional GWAS single-variant tests when the causal variants were untyped and rare, or in the presence of haplotype effects. We also applied our method to IBD mapping of six anthropometric traits using the UK Biobank data and identified a total of 3,442 associations, 2,131 (62%) of which remained significant after conditioning on suggestive tag variants in the ± 3 centimorgan flanking regions from GWAS.


Asunto(s)
Bancos de Muestras Biológicas , Estudio de Asociación del Genoma Completo , Humanos , Estudio de Asociación del Genoma Completo/métodos , Variaciones en el Número de Copia de ADN , Haplotipos/genética , Fenotipo , Polimorfismo de Nucleótido Simple/genética
3.
Bioinformatics ; 39(1)2023 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-36440908

RESUMEN

MOTIVATION: The positional Burrows-Wheeler transform (PBWT) has led to tremendous strides in haplotype matching on biobank-scale data. For genetic genealogical search, PBWT-based methods have optimized the asymptotic runtime of finding long matches between a query haplotype and a predefined panel of haplotypes. However, to enable fast query searches, the full-sized panel and PBWT data structures must be kept in memory, preventing existing algorithms from scaling up to modern biobank panels consisting of millions of haplotypes. In this work, we propose a space-efficient variation of PBWT named Syllable-PBWT, which divides every haplotype into syllables, builds the PBWT positional prefix arrays on the compressed syllabic panel, and leverages the polynomial rolling hash function for positional substring comparison. With the Syllable-PBWT data structures, we then present a long match query algorithm named Syllable-Query. RESULTS: Compared to the most time- and space-efficient previously published solution to the long match query problem, Syllable-Query reduced the memory use by a factor of over 100 on both the UK Biobank genotype data and the 1000 Genomes Project sequence data. Surprisingly, the smaller size of our syllabic data structures allows for more efficient iteration and CPU cache usage, granting Syllable-Query even faster runtime than existing solutions. AVAILABILITY AND IMPLEMENTATION: https://github.com/ZhiGroup/Syllable-PBWT. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Algoritmos , Genoma , Haplotipos , Genotipo , Programas Informáticos , Análisis de Secuencia de ADN/métodos
4.
Bioinformatics ; 39(6)2023 06 01.
Artículo en Inglés | MEDLINE | ID: mdl-37166451

RESUMEN

MOTIVATION: Due to the rapid growth of the genetic database size, genealogical search, a process of inferring familial relatedness by identifying DNA matches, has become a viable approach to help individuals finding missing family members or law enforcement agencies locating suspects. A fast and accurate method is needed to search an out-of-database individual against millions of individuals. Most existing approaches only offer all-versus-all within panel match. Some prototype algorithms offer one-versus-all query from out-of-panel individual, but they do not tolerate errors. RESULTS: A new method, random projection-based identity-by-descent (IBD) detection (RaPID) query, is introduced to make fast genealogical search possible. RaPID-Query identifies IBD segments between a query haplotype and a panel of haplotypes. By integrating matches over multiple PBWT indexes, RaPID-Query manages to locate IBD segments quickly with a given cutoff length while allowing mismatched sites. A single query against all UK biobank autosomal chromosomes was completed within 2.76 seconds on average, with the minimum length 7 cM and 700 markers. RaPID-Query achieved a 0.016 false negative rate and a 0.012 false positive rate simultaneously on a chromosome 20 sequencing panel having 86 265 sites. This is comparable to the state-of-the-art IBD detection method TPBWT(out-of-sample) and Hap-IBD. The high-quality IBD segments yielded by RaPID-Query were able to distinguish up to fourth degree of the familial relatedness for a given individual pair, and the area under the receiver operating characteristic curve values are at least 97.28%. AVAILABILITY AND IMPLEMENTATION: The RaPID-Query program is available at https://github.com/ucfcbb/RaPID-Query.


Asunto(s)
Algoritmos , Cromosomas , Humanos , Haplotipos , Análisis de Secuencia
5.
PLoS Genet ; 17(1): e1009315, 2021 01.
Artículo en Inglés | MEDLINE | ID: mdl-33476339

RESUMEN

Inference of relationships from whole-genome genetic data of a cohort is a crucial prerequisite for genome-wide association studies. Typically, relationships are inferred by computing the kinship coefficients (ϕ) and the genome-wide probability of zero IBD sharing (π0) among all pairs of individuals. Current leading methods are based on pairwise comparisons, which may not scale up to very large cohorts (e.g., sample size >1 million). Here, we propose an efficient relationship inference method, RAFFI. RAFFI leverages the efficient RaPID method to call IBD segments first, then estimate the ϕ and π0 from detected IBD segments. This inference is achieved by a data-driven approach that adjusts the estimation based on phasing quality and genotyping quality. Using simulations, we showed that RAFFI is robust against phasing/genotyping errors, admix events, and varying marker densities, and achieves higher accuracy compared to KING, the current leading method, especially for more distant relatives. When applied to the phased UK Biobank data with ~500K individuals, RAFFI is approximately 18 times faster than KING. We expect RAFFI will offer fast and accurate relatedness inference for even larger cohorts.


Asunto(s)
Estudio de Asociación del Genoma Completo/estadística & datos numéricos , Técnicas de Genotipaje/estadística & datos numéricos , Modelos Genéticos , Bancos de Muestras Biológicas , Genoma Humano/genética , Haplotipos/genética , Humanos , Linaje , Polimorfismo de Nucleótido Simple/genética
6.
BMC Biol ; 19(1): 32, 2021 02 16.
Artículo en Inglés | MEDLINE | ID: mdl-33593342

RESUMEN

BACKGROUND: The genealogical histories of individuals within populations are of interest to studies aiming both to uncover detailed pedigree information and overall quantitative population demographic histories. However, the analysis of quantitative details of individual genealogical histories has faced challenges from incomplete available pedigree records and an absence of objective and quantitative details in pedigree information. Although complete pedigree information for most individuals is difficult to track beyond a few generations, it is possible to describe a person's genealogical history using their genetic relatives revealed by identity by descent (IBD) segments-long genomic segments shared by two individuals within a population, which are identical due to inheritance from common ancestors. When modern biobanks collect genotype information for a significant fraction of a population, dense genetic connections of a person can be traced using such IBD segments, offering opportunities to characterize individuals in the context of the underlying populations. Here, we conducted an individual-centric analysis of IBD segments among the UK Biobank participants that represent 0.7% of the UK population. RESULTS: We made a high-quality call set of IBD segments over 5 cM among all 500,000 UK Biobank participants. On average, one UK individual shares IBD segments with 14,000 UK Biobank participants, which we refer to as "relatives." Using these segments, approximately 80% of a person's genome can be imputed. We subsequently propose genealogical descriptors based on the genetic connections of relative cohorts of individuals sharing at least one IBD segment and show that such descriptors offer important information about one's genetic makeup, personal genealogical history, and social behavior. Through analysis of relative counts sharing segments at different lengths, we identified a group, potentially British Jews, who has a distinct pattern of familial expansion history. Finally, using the enrichment of relatives in one's neighborhood, we identified regional variations of personal preference favoring living closer to one's extended families. CONCLUSIONS: Our analysis revealed genetic makeup, personal genealogical history, and social behaviors at the population scale, opening possibilities for further studies of individual's genetic connections in biobank data.


Asunto(s)
Bancos de Muestras Biológicas/estadística & datos numéricos , Genealogía y Heráldica , Variación Genética , Linaje , Humanos , Reino Unido
7.
Nat Methods ; 15(11): 928-931, 2018 11.
Artículo en Inglés | MEDLINE | ID: mdl-30377374

RESUMEN

Clustered regularly interspaced short palindromic repeats (CRISPR) guide RNA scaffolds have been adapted to carry multiple binding sites for fluorescent proteins to enhance brightness for live cell imaging of genomic loci. However, many of these modifications result in guide RNA instability and thus produce lower genome-labeling efficiency than anticipated. Here we introduce CRISPR-Sirius, based on octet arrays of aptamers conferring both enhanced guide RNA stability and brightness, and provide initial biological applications of this platform.


Asunto(s)
Sistemas CRISPR-Cas , Colorantes Fluorescentes/química , Genoma Humano , Genómica/métodos , Imagen Molecular/métodos , ARN Guía de Kinetoplastida/genética , Células A549 , Sitios de Unión , Genes Reporteros , Células HEK293 , Humanos , Microscopía Fluorescente
8.
Bioinformatics ; 35(14): i233-i241, 2019 07 15.
Artículo en Inglés | MEDLINE | ID: mdl-31510689

RESUMEN

MOTIVATION: With the wide availability of whole-genome genotype data, there is an increasing need for conducting genetic genealogical searches efficiently. Computationally, this task amounts to identifying shared DNA segments between a query individual and a very large panel containing millions of haplotypes. The celebrated Positional Burrows-Wheeler Transform (PBWT) data structure is a pre-computed index of the panel that enables constant time matching at each position between one haplotype and an arbitrarily large panel. However, the existing algorithm (Durbin's Algorithm 5) can only identify set-maximal matches, the longest matches ending at any location in a panel, while in real genealogical search scenarios, multiple 'good enough' matches are desired. RESULTS: In this work, we developed two algorithmic extensions of Durbin's Algorithm 5, that can find all L-long matches, matches longer than or equal to a given length L, between a query and a panel. In the first algorithm, PBWT-Query, we introduce 'virtual insertion' of the query into the PBWT matrix of the panel, and then scanning up and down for the PBWT match blocks with length greater than L. In our second algorithm, L-PBWT-Query, we further speed up PBWT-Query by introducing additional data structures that allow us to avoid iterating through blocks of incomplete matches. The efficiency of PBWT-Query and L-PBWT-Query is demonstrated using the simulated data and the UK Biobank data. Our results show that our proposed algorithms can detect related individuals for a given query efficiently in very large cohorts which enables a fast on-line query search. AVAILABILITY AND IMPLEMENTATION: genome.ucf.edu/pbwt-query. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Haplotipos , Algoritmos , Genoma , Genotipo , Programas Informáticos
9.
BMC Bioinformatics ; 20(Suppl 11): 279, 2019 Jun 06.
Artículo en Inglés | MEDLINE | ID: mdl-31167638

RESUMEN

BACKGROUND: Recent advances in whole-genome sequencing and SNP array technology have led to the generation of a large amount of genotype data. Large volumes of genotype data will require faster and more efficient methods for storing and searching the data. Positional Burrows-Wheeler Transform (PBWT) provides an appropriate data structure for bi-allelic data. With the increasing sample sizes, more multi-allelic sites are expected to be observed. Hence, there is a necessity to handle multi-allelic genotype data. RESULTS: In this paper, we introduce a multi-allelic version of the Positional Burrows-Wheeler Transform (mPBWT) based on the bi-allelic version for compression and searching. The time-complexity for constructing the data structure and searching within a panel containing t-allelic sites increases by a factor of t. CONCLUSION: Considering the small value for the possible alleles t, the time increase for the multi-allelic PBWT will be negligible and comparable to the bi-allelic version of PBWT.


Asunto(s)
Algoritmos , Alelos , Compresión de Datos , Genes , Haplotipos/genética , Humanos , Factores de Tiempo
10.
Proc Natl Acad Sci U S A ; 112(10): 3002-7, 2015 Mar 10.
Artículo en Inglés | MEDLINE | ID: mdl-25713381

RESUMEN

The intranuclear location of genomic loci and the dynamics of these loci are important parameters for understanding the spatial and temporal regulation of gene expression. Recently it has proven possible to visualize endogenous genomic loci in live cells by the use of transcription activator-like effectors (TALEs), as well as modified versions of the bacterial immunity clustered regularly interspersed short palindromic repeat (CRISPR)/CRISPR-associated protein 9 (Cas9) system. Here we report the design of multicolor versions of CRISPR using catalytically inactive Cas9 endonuclease (dCas9) from three bacterial orthologs. Each pair of dCas9-fluorescent proteins and cognate single-guide RNAs (sgRNAs) efficiently labeled several target loci in live human cells. Using pairs of differently colored dCas9-sgRNAs, it was possible to determine the intranuclear distance between loci on different chromosomes. In addition, the fluorescence spatial resolution between two loci on the same chromosome could be determined and related to the linear distance between them on the chromosome's physical map, thereby permitting assessment of the DNA compaction of such regions in a live cell.


Asunto(s)
Mapeo Cromosómico , Cromosomas Humanos , Repeticiones Palindrómicas Cortas Agrupadas y Regularmente Espaciadas , Línea Celular Tumoral , Humanos , Microscopía Fluorescente
11.
Elife ; 132024 Jun 21.
Artículo en Inglés | MEDLINE | ID: mdl-38905121

RESUMEN

Runs-of-homozygosity (ROH) segments, contiguous homozygous regions in a genome were traditionally linked to families and inbred populations. However, a growing literature suggests that ROHs are ubiquitous in outbred populations. Still, most existing genetic studies of ROH in populations are limited to aggregated ROH content across the genome, which does not offer the resolution for mapping causal loci. This limitation is mainly due to a lack of methods for the efficient identification of shared ROH diplotypes. Here, we present a new method, ROH-DICE (runs-of-homozygous diplotype cluster enumerator), to find large ROH diplotype clusters, sufficiently long ROHs shared by a sufficient number of individuals, in large cohorts. ROH-DICE identified over 1 million ROH diplotypes that span over 100 single nucleotide polymorphisms (SNPs) and are shared by more than 100 UK Biobank participants. Moreover, we found significant associations of clustered ROH diplotypes across the genome with various self-reported diseases, with the strongest associations found between the extended human leukocyte antigen (HLA) region and autoimmune disorders. We found an association between a diplotype covering the homeostatic iron regulator (HFE) gene and hemochromatosis, even though the well-known causal SNP was not directly genotyped or imputed. Using a genome-wide scan, we identified a putative association between carriers of an ROH diplotype in chromosome 4 and an increase in mortality among COVID-19 patients (p-value = 1.82 × 10-11). In summary, our ROH-DICE method, by calling out large ROH diplotypes in a large outbred population, enables further population genetics into the demographic history of large populations. More importantly, our method enables a new genome-wide mapping approach for finding disease-causing loci with multi-marker recessive effects at a population scale.


Asunto(s)
Bancos de Muestras Biológicas , COVID-19 , Homocigoto , Polimorfismo de Nucleótido Simple , Humanos , Reino Unido , Polimorfismo de Nucleótido Simple/genética , COVID-19/genética , SARS-CoV-2/genética , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Genoma Humano , Biobanco del Reino Unido
12.
bioRxiv ; 2023 Jan 10.
Artículo en Inglés | MEDLINE | ID: mdl-36712114

RESUMEN

While rates of recombination events across the genome (genetic maps) are fundamental to genetic research, the majority of current studies only use one standard map. There is evidence suggesting population differences in genetic maps, and thus estimating population-specific maps are of interest. While the recent availability of biobank-scale data offers such opportunities, current methods are not efficient at leveraging very large sample sizes. The most accurate methods are still linkage-disequilibrium (LD)-based methods that are only tractable for a few hundred samples. In this work, we propose a fast and memory-efficient method for estimating genetic maps from population genotyping data. Our method, FastRecomb, leverages the efficient positional Burrows-Wheeler transform (PBWT) data structure for counting IBD segment boundaries as potential recombination events. We used PBWT blocks to avoid redundant counting of pairwise matches. Moreover, we used a panel smoothing technique to reduce the noise from errors and recent mutations. Using simulation, we found that FastRecomb achieves state-of-the-art performance at 10k resolution, in terms of correlation coefficients between the estimated map and the ground truth. This is mainly due to the fact that FastRecomb can effectively take advantage of large panels comprising more than hundreds of thousands of haplotypes. At the same time, other methods lack the efficiency to handle such data. We believe further refinement of FastRecomb would deliver more accurate genetic maps for the genetics community.

13.
Gigascience ; 112022 12 06.
Artículo en Inglés | MEDLINE | ID: mdl-36472573

RESUMEN

In the recent biobank era of genetics, the problem of identical-by-descent (IBD) segment detection received renewed interest, as IBD segments in large cohorts offer unprecedented opportunities in the study of population and genealogical history, as well as genetic association of long haplotypes. While a new generation of efficient methods for IBD segment detection becomes available, direct comparison of these methods is difficult: existing benchmarks were often evaluated in different datasets, with some not openly accessible; methods benchmarked were run under suboptimal parameters; and benchmark performance metrics were not defined consistently. Here, we developed a comprehensive and completely open-source evaluation of the power, accuracy, and resource consumption of these IBD segment detection methods using realistic population genetic simulations with various settings. Our results pave the road for fair evaluation of IBD segment detection methods and provide an practical guide for users.


Asunto(s)
Bancos de Muestras Biológicas , Humanos
14.
Bioinform Adv ; 2(1): vbac045, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35785021

RESUMEN

Motivation: As large haplotype panels become increasingly available, efficient string matching algorithms such as positional Burrows-Wheeler transformation (PBWT) are promising for identifying shared haplotypes. However, recent mutations and genotyping errors create occasional mismatches, presenting challenges for exact haplotype matching. Previous solutions are based on probabilistic models or seed-and-extension algorithms that passively tolerate mismatches. Results: Here, we propose a PBWT-based smoothing algorithm, P-smoother, to actively 'correct' these mismatches and thus 'smooth' the panel. P-smoother runs a bidirectional PBWT-based panel scanning that flips mismatching alleles based on the overall haplotype matching context, which we call the IBD (identical-by-descent) prior. In a simulated panel with 4000 haplotypes and a 0.2% error rate, we show it can reliably correct 85% of errors. As a result, PBWT algorithms running over the smoothed panel can identify more pairwise IBD segments than that over the unsmoothed panel. Most strikingly, a PBWT-cluster algorithm running over the smoothed panel, which we call PS-cluster, achieves state-of-the-art performance for identifying multiway IBD segments, a challenging problem in the computational community for years. We also showed that PS-cluster is adequately efficient for UK Biobank data. Therefore, P-smoother opens up new possibilities for efficient error-tolerating algorithms for biobank-scale haplotype panels. Availability and implementation: Source code is available at github.com/ZhiGroup/P-smoother.

15.
medRxiv ; 2020 Oct 27.
Artículo en Inglés | MEDLINE | ID: mdl-33140058

RESUMEN

Runs of homozygosity (ROH) segments, contiguous homozygous regions in a genome were traditionally linked to families and inbred populations. However, a growing literature suggests that ROHs are ubiquitous in outbred populations. Still, most existing genetic studies of ROH in populations are limited to aggregated ROH content across the genome, which does not offer the resolution for mapping causal loci. This limitation is mainly due to a lack of methods for efficient identification of shared ROH diplotypes. Here, we present a new method, ROH-DICE, to find large ROH diplotype clusters, sufficiently long ROHs shared by a sufficient number of individuals, in large cohorts. ROH-DICE identified over 1 million ROH diplotypes that span over 100 SNPs and shared by more than 100 UK Biobank participants. Moreover, we found significant associations of clustered ROH diplotypes across the genome with various self-reported diseases, with the strongest associations found between the extended HLA region and autoimmune disorders. We found an association between a diplotype covering the HFE gene and haemochromatosis, even though the well-known causal SNP was not directly genotyped nor imputed. Using genome-wide scan, we identified a putative association between carriers of an ROH diplotype in chromosome 4 and an increase of mortality among COVID-19 patients. In summary, our ROH-DICE method, by calling out large ROH diplotypes in a large outbred population, enables further population genetics into the demographic history of large populations. More importantly, our method enables a new genome-wide mapping approach for finding disease-causing loci with multi-marker recessive effects at population scale.

16.
Genome Biol ; 21(1): 296, 2020 12 08.
Artículo en Inglés | MEDLINE | ID: mdl-33292531

RESUMEN

INTRODUCTION: Despite the long-observed correlation between H3K9me3, chromatin architecture, and transcriptional repression, how H3K9me3 regulates genome higher-order organization and transcriptional activity in living cells remains unclear. RESULT: Here, we develop EpiGo (Epigenetic perturbation induced Genome organization)-KRAB to introduce H3K9me3 at hundreds of loci spanning megabases on human chromosome 19 and simultaneously track genome organization. EpiGo-KRAB is sufficient to induce genomic clustering and de novo heterochromatin-like domain formation, which requires SETDB1, a methyltransferase of H3K9me3. Unexpectedly, EpiGo-KRAB-induced heterochromatin-like domain does not result in widespread gene repression except a small set of genes with concurrent loss of H3K4me3 and H3K27ac. Ectopic H3K9me3 appears to spread in inactive regions but is largely restricted from transcriptional initiation sites in active regions. Finally, Hi-C analysis showed that EpiGo-KRAB reshapes existing compartments mainly at compartment boundaries. CONCLUSIONS: These results reveal the role of H3K9me3 in genome organization could be partially separated from its function in gene repression.


Asunto(s)
Cromatina/metabolismo , Epigénesis Genética , Genoma , Histonas/metabolismo , Línea Celular , Femenino , Técnicas de Silenciamiento del Gen , Silenciador del Gen , Células HEK293 , Heterocromatina , N-Metiltransferasa de Histona-Lisina/genética , Humanos , Transcripción Genética
17.
Genome Biol ; 20(1): 143, 2019 07 25.
Artículo en Inglés | MEDLINE | ID: mdl-31345249

RESUMEN

While genetic relatedness, usually manifested as segments identical by descent (IBD), is ubiquitous in modern large biobanks, current IBD detection methods are not efficient at such a scale. Here, we describe an efficient method, RaPID, for detecting IBD segments in a panel with phased haplotypes. RaPID achieves a time and space complexity linear to the input size and the number of reported IBDs. With simulation, we showed that RaPID is orders of magnitude faster than existing method while offering competitive power and accuracy. In UK Biobank, RaPID identified 3,335,807 IBDs with a lenght ≥ 10 cM among 223,507 male X chromosomes in 11 min.


Asunto(s)
Técnicas de Genotipaje/métodos , Linaje , Bancos de Muestras Biológicas , Cromosomas Humanos X , Estudios de Cohortes , Simulación por Computador , Haplotipos , Humanos , Masculino
18.
J Cell Biol ; 218(5): 1467-1477, 2019 05 06.
Artículo en Inglés | MEDLINE | ID: mdl-30846483

RESUMEN

In contrast to the well-studied condensation and folding of chromosomes during mitosis, their dynamics during interphase are less understood. We deployed a CRISPR-based DNA imaging system to track the dynamics of genomic loci situated kilobases to megabases apart on a single chromosome. Two distinct modes of dynamics were resolved: local movements as well as ones that might reflect translational movements of the entire domain within the nucleoplasmic space. The magnitude of both of these modes of movements increased from early to late G1, whereas the translational movements were reduced in early S phase. The local fluctuations decreased slightly in early S and more markedly in mid-late S. These newly observed movements and their cell cycle dependence suggest the existence of a hitherto unrecognized compaction-relaxation dynamic of the interphase chromosome fiber, operating concurrently with changes in the extent of overall movements of loci in the 4D genome.


Asunto(s)
Ciclo Celular/genética , Núcleo Celular/genética , Cromosomas Humanos/genética , Genoma Humano , Genómica/métodos , Mitosis/genética , Osteosarcoma/genética , Neoplasias Óseas/genética , Neoplasias Óseas/patología , Humanos , Interfase , Osteosarcoma/patología , Células Tumorales Cultivadas
19.
Nat Biotechnol ; 34(5): 528-30, 2016 05.
Artículo en Inglés | MEDLINE | ID: mdl-27088723

RESUMEN

A lack of techniques to image multiple genomic loci in living cells has limited our ability to investigate chromosome dynamics. Here we describe CRISPRainbow, a system for labeling DNA in living cells based on nuclease-dead (d) Cas9 combined with engineered single guide RNA (sgRNA) scaffolds that bind sets of fluorescent proteins. We demonstrate simultaneous imaging of up to six chromosomal loci in individual live cells and document large differences in the dynamic properties of different chromosomal loci.


Asunto(s)
Proteínas Bacterianas/genética , Repeticiones Palindrómicas Cortas Agrupadas y Regularmente Espaciadas/genética , Endonucleasas/genética , Edición Génica/métodos , Microscopía Fluorescente/métodos , ARN/genética , Proteína 9 Asociada a CRISPR , Sitios Genéticos/genética , Coloración y Etiquetado
20.
J Cell Biol ; 214(5): 529-37, 2016 08 29.
Artículo en Inglés | MEDLINE | ID: mdl-27551060

RESUMEN

The bacterial CRISPR-Cas9 system has been repurposed for genome engineering, transcription modulation, and chromosome imaging in eukaryotic cells. However, the nuclear dynamics of clustered regularly interspaced short palindromic repeats (CRISPR)-associated protein 9 (Cas9) guide RNAs and target interrogation are not well defined in living cells. Here, we deployed a dual-color CRISPR system to directly measure the stability of both Cas9 and guide RNA. We found that Cas9 is essential for guide RNA stability and that the nuclear Cas9-guide RNA complex levels limit the targeting efficiency. Fluorescence recovery after photobleaching measurements revealed that single mismatches in the guide RNA seed sequence reduce the target residence time from >3 h to as low as <2 min in a nucleotide identity- and position-dependent manner. We further show that the duration of target residence correlates with cleavage activity. These results reveal that CRISPR discriminates between genuine versus mismatched targets for genome editing via radical alterations in residence time.


Asunto(s)
Sistemas CRISPR-Cas/genética , Núcleo Celular/metabolismo , Secuencia de Bases , Línea Celular Tumoral , Supervivencia Celular , ADN/metabolismo , Humanos , Modelos Biológicos , Estabilidad del ARN , ARN Guía de Kinetoplastida , Factores de Tiempo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA