Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 31
Filtrar
1.
Cell ; 154(3): 530-40, 2013 Aug 01.
Artigo em Inglês | MEDLINE | ID: mdl-23911320

RESUMO

To mechanistically characterize the microevolutionary processes active in altering transcription factor (TF) binding among closely related mammals, we compared the genome-wide binding of three tissue-specific TFs that control liver gene expression in six rodents. Despite an overall fast turnover of TF binding locations between species, we identified thousands of TF regions of highly constrained TF binding intensity. Although individual mutations in bound sequence motifs can influence TF binding, most binding differences occur in the absence of nearby sequence variations. Instead, combinatorial binding was found to be significant for genetic and evolutionary stability; cobound TFs tend to disappear in concert and were sensitive to genetic knockout of partner TFs. The large, qualitative differences in genomic regions bound between closely related mammals, when contrasted with the smaller, quantitative TF binding differences among Drosophila species, illustrate how genome structure and population genetics together shape regulatory evolution.


Assuntos
Evolução Molecular , Camundongos/classificação , Camundongos/genética , Fatores de Transcrição/genética , Animais , Drosophila/genética , Fígado/metabolismo , Camundongos/metabolismo , Camundongos Endogâmicos , Camundongos Knockout , Ratos/genética , Fatores de Transcrição/metabolismo
2.
Mol Ecol ; : e17257, 2023 Dec 27.
Artigo em Inglês | MEDLINE | ID: mdl-38149334

RESUMO

The question of how local adaptation takes place remains a fundamental question in evolutionary biology. The variation of allele frequencies in genes under selection over environmental gradients remains mainly theoretical and its empirical assessment would help understanding how adaptation happens over environmental clines. To bring new insights to this issue we set up a broad framework which aimed to compare the adaptive trajectories over environmental clines in two domesticated mammal species co-distributed in diversified landscapes. We sequenced the genomes of 160 sheep and 161 goats extensively managed along environmental gradients, including temperature, rainfall, seasonality and altitude, to identify genes and biological processes shaping local adaptation. Allele frequencies at putatively adaptive loci were rarely found to vary gradually along environmental gradients, but rather displayed a discontinuous shift at the extremities of environmental clines. Of the 430 candidate adaptive genes identified, only 6 were orthologous between sheep and goats and those responded differently to environmental pressures, suggesting different putative mechanisms involved in local adaptation in these two closely related species. Interestingly, the genomes of the 2 species were impacted differently by the environment, genes related to signatures of selection were most related to altitude, slope and rainfall seasonality for sheep, and summer temperature and spring rainfall for goats. The diversity of candidate adaptive pathways may result from a high number of biological functions involved in the adaptations to multiple eco-climatic gradients, and a differential role of climatic drivers on the two species, despite their co-distribution along the same environmental gradients. This study describes empirical examples of clinal variation in putatively adaptive alleles with different patterns in allele frequency distributions over continuous environmental gradients, thus showing the diversity of genetic responses in adaptive landscapes and opening new horizons for understanding genomics of adaptation in mammalian species and beyond.

3.
Nature ; 546(7658): 370-375, 2017 06 15.
Artigo em Inglês | MEDLINE | ID: mdl-28489815

RESUMO

Technology utilizing human induced pluripotent stem cells (iPS cells) has enormous potential to provide improved cellular models of human disease. However, variable genetic and phenotypic characterization of many existing iPS cell lines limits their potential use for research and therapy. Here we describe the systematic generation, genotyping and phenotyping of 711 iPS cell lines derived from 301 healthy individuals by the Human Induced Pluripotent Stem Cells Initiative. Our study outlines the major sources of genetic and phenotypic variation in iPS cells and establishes their suitability as models of complex human traits and cancer. Through genome-wide profiling we find that 5-46% of the variation in different iPS cell phenotypes, including differentiation capacity and cellular morphology, arises from differences between individuals. Additionally, we assess the phenotypic consequences of genomic copy-number alterations that are repeatedly observed in iPS cells. In addition, we present a comprehensive map of common regulatory variants affecting the transcriptome of human pluripotent cells.


Assuntos
Variação Genética/genética , Células-Tronco Pluripotentes Induzidas/metabolismo , Células Cultivadas , Reprogramação Celular/genética , Variações do Número de Cópias de DNA/genética , Regulação da Expressão Gênica/genética , Genótipo , Humanos , Especificidade de Órgãos , Fenótipo , Controle de Qualidade , Locos de Características Quantitativas/genética , Transcriptoma/genética
5.
Genome Res ; 28(11): 1720-1732, 2018 11.
Artigo em Inglês | MEDLINE | ID: mdl-30341161

RESUMO

Despite the rapid development of sequencing technologies, the assembly of mammalian-scale genomes into complete chromosomes remains one of the most challenging problems in bioinformatics. To help address this difficulty, we developed Ragout 2, a reference-assisted assembly tool that works for large and complex genomes. By taking one or more target assemblies (generated from an NGS assembler) and one or multiple related reference genomes, Ragout 2 infers the evolutionary relationships between the genomes and builds the final assemblies using a genome rearrangement approach. By using Ragout 2, we transformed NGS assemblies of 16 laboratory mouse strains into sets of complete chromosomes, leaving <5% of sequence unlocalized per set. Various benchmarks, including PCR testing and realigning of long Pacific Biosciences (PacBio) reads, suggest only a small number of structural errors in the final assemblies, comparable with direct assembly approaches. We applied Ragout 2 to the Mus caroli and Mus pahari genomes, which exhibit karyotype-scale variations compared with other genomes from the Muridae family. Chromosome painting maps confirmed most large-scale rearrangements that Ragout 2 detected. We applied Ragout 2 to improve draft sequences of three ape genomes that have recently been published. Ragout 2 transformed three sets of contigs (generated using PacBio reads only) into chromosome-scale assemblies with accuracy comparable to chromosome assemblies generated in the original study using BioNano maps, Hi-C, BAC clones, and FISH.


Assuntos
Mapeamento de Sequências Contíguas/métodos , Sequenciamento Completo do Genoma/métodos , Animais , Mapeamento de Sequências Contíguas/normas , Camundongos , Padrões de Referência , Sequenciamento Completo do Genoma/normas
6.
Genome Res ; 28(4): 448-459, 2018 04.
Artigo em Inglês | MEDLINE | ID: mdl-29563166

RESUMO

Understanding the mechanisms driving lineage-specific evolution in both primates and rodents has been hindered by the lack of sister clades with a similar phylogenetic structure having high-quality genome assemblies. Here, we have created chromosome-level assemblies of the Mus caroli and Mus pahari genomes. Together with the Mus musculus and Rattus norvegicus genomes, this set of rodent genomes is similar in divergence times to the Hominidae (human-chimpanzee-gorilla-orangutan). By comparing the evolutionary dynamics between the Muridae and Hominidae, we identified punctate events of chromosome reshuffling that shaped the ancestral karyotype of Mus musculus and Mus caroli between 3 and 6 million yr ago, but that are absent in the Hominidae. Hominidae show between four- and sevenfold lower rates of nucleotide change and feature turnover in both neutral and functional sequences, suggesting an underlying coherence to the Muridae acceleration. Our system of matched, high-quality genome assemblies revealed how specific classes of repeats can play lineage-specific roles in related species. Recent LINE activity has remodeled protein-coding loci to a greater extent across the Muridae than the Hominidae, with functional consequences at the species level such as reproductive isolation. Furthermore, we charted a Muridae-specific retrotransposon expansion at unprecedented resolution, revealing how a single nucleotide mutation transformed a specific SINE element into an active CTCF binding site carrier specifically in Mus caroli, which resulted in thousands of novel, species-specific CTCF binding sites. Our results show that the comparison of matched phylogenetic sets of genomes will be an increasingly powerful strategy for understanding mammalian biology.


Assuntos
Evolução Molecular , Genoma/genética , Muridae/genética , Filogenia , Animais , Sítios de Ligação , Fator de Ligação a CCCTC/genética , Cromossomos/genética , Cariotipagem/métodos , Elementos Nucleotídeos Longos e Dispersos/genética , Camundongos , Retroelementos/genética , Especificidade da Espécie
7.
Nucleic Acids Res ; 45(D1): D691-D697, 2017 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-27733501

RESUMO

The Human Induced Pluripotent Stem Cell Initiative (HipSci) isf establishing a large catalogue of human iPSC lines, arguably the most well characterized collection to date. The HipSci portal enables researchers to choose the right cell line for their experiment, and makes HipSci's rich catalogue of assay data easy to discover and reuse. Each cell line has genomic, transcriptomic, proteomic and cellular phenotyping data. Data are deposited in the appropriate EMBL-EBI archives, including the European Nucleotide Archive (ENA), European Genome-phenome Archive (EGA), ArrayExpress and PRoteomics IDEntifications (PRIDE) databases. The project will make 500 cell lines from healthy individuals, and from 150 patients with rare genetic diseases; these will be available through the European Collection of Authenticated Cell Cultures (ECACC). As of August 2016, 238 cell lines are available for purchase. Project data is presented through the HipSci data portal (http://www.hipsci.org/lines) and is downloadable from the associated FTP site (ftp://ftp.hipsci.ebi.ac.uk/vol1/ftp). The data portal presents a summary matrix of the HipSci cell lines, showing available data types. Each line has its own page containing descriptive metadata, quality information, and links to archived assay data. Analysis results are also available in a Track Hub, allowing visualization in the context of public genomic annotations (http://www.hipsci.org/data/trackhubs).


Assuntos
Biologia Computacional/métodos , Bases de Dados Genéticas , Genômica/métodos , Células-Tronco Pluripotentes Induzidas , Linhagem Celular , Estudos de Associação Genética , Humanos , Células-Tronco Pluripotentes Induzidas/citologia , Células-Tronco Pluripotentes Induzidas/metabolismo , Proteoma , Software , Transcriptoma
8.
Nucleic Acids Res ; 45(D1): D854-D859, 2017 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-27638885

RESUMO

The International Genome Sample Resource (IGSR; http://www.internationalgenome.org) expands in data type and population diversity the resources from the 1000 Genomes Project. IGSR represents the largest open collection of human variation data and provides easy access to these resources. IGSR was established in 2015 to maintain and extend the 1000 Genomes Project data, which has been widely used as a reference set of human variation and by researchers developing analysis methods. IGSR has mapped all of the 1000 Genomes sequence to the newest human reference (GRCh38), and will release updated variant calls to ensure maximal usefulness of the existing data. IGSR is collecting new structural variation data on the 1000 Genomes samples from long read sequencing and other technologies, and will collect relevant functional data into a single comprehensive resource. IGSR is extending coverage with new populations sequenced by collaborating groups. Here, we present the new data and analysis that IGSR has made available. We have also introduced a new data portal that increases discoverability of our data-previously only browseable through our FTP site-by focusing on particular samples, populations or data sets of interest.


Assuntos
Biologia Computacional/métodos , Bases de Dados Genéticas , Variação Genética , Genoma , Genômica/métodos , Navegador
9.
Methods ; 96: 85-96, 2016 Mar 01.
Artigo em Inglês | MEDLINE | ID: mdl-26608109

RESUMO

Induced pluripotent stem cells (iPSCs) provide invaluable opportunities for future cell therapies as well as for studying human development, modelling diseases and discovering therapeutics. In order to realise the potential of iPSCs, it is crucial to comprehensively characterise cells generated from large cohorts of healthy and diseased individuals. The human iPSC initiative (HipSci) is assessing a large panel of cell lines to define cell phenotypes, dissect inter- and intra-line and donor variability and identify its key determinant components. Here we report the establishment of a high-content platform for phenotypic analysis of human iPSC lines. In the described assay, cells are dissociated and seeded as single cells onto 96-well plates coated with fibronectin at three different concentrations. This method allows assessment of cell number, proliferation, morphology and intercellular adhesion. Altogether, our strategy delivers robust quantification of phenotypic diversity within complex cell populations facilitating future identification of the genetic, biological and technical determinants of variance. Approaches such as the one described can be used to benchmark iPSCs from multiple donors and create novel platforms that can readily be tailored for disease modelling and drug discovery.


Assuntos
Fibronectinas/química , Ensaios de Triagem em Larga Escala , Células-Tronco Pluripotentes Induzidas/ultraestrutura , Imagem Molecular/métodos , Fenótipo , Sequência de Aminoácidos , Adesão Celular , Diferenciação Celular , Linhagem Celular , Células Alimentadoras/citologia , Humanos , Células-Tronco Pluripotentes Induzidas/metabolismo , Dados de Sequência Molecular , Peptídeos/química
10.
Phys Chem Chem Phys ; 17(34): 22377-88, 2015 Sep 14.
Artigo em Inglês | MEDLINE | ID: mdl-26247336

RESUMO

Classical molecular dynamics (MD) simulations have been employed to study the interaction of the saccharides glucuronic acid (GlcA) and N-acetylgalactosamine (GalNAc) with the (0001) and (011̄0) surfaces of the mineral hydroxyapatite (HAP). GlcA and GalNAc are the two constituent monosaccharides of the glycosaminoglycan chondroitin sulfate, which is commonly found in bone and cartilage and has been implicated in the modulation of the hydroxyapatite biomineralization process. MD simulations of the mineral surfaces and the saccharides in the presence of solvent water allowed the calculation of the adsorption energies of the saccharides on the HAP surfaces. The calculations show that GalNAc interacts with HAP principally through the sulfate and the carbonyl of acetyl amine groups, whereas the GlcA interacts primarily through the carboxylate functional groups. The mode and strength of the interaction depends on the orientation of the saccharide with respect to the surface and the level of disruption of the layer of water competing with the saccharide for adsorption sites on the HAP surface, suggesting that chondroitin 4-sulfate binds to the layer of solvent water rather than to HAP.


Assuntos
Acetilgalactosamina/química , Sulfatos de Condroitina/química , Durapatita/química , Ácido Glucurônico/química , Simulação de Dinâmica Molecular , Água/química , Adsorção , Configuração de Carboidratos , Monossacarídeos/química , Propriedades de Superfície
11.
J Biol Chem ; 285(21): 15674-81, 2010 May 21.
Artigo em Inglês | MEDLINE | ID: mdl-20308077

RESUMO

Little is known about the rate at which protein turnover occurs in living tendon and whether the rate differs between tendons with different physiological roles. In this study, we have quantified the racemization of aspartic acid to calculate the age of the collagenous and non-collagenous components of the high strain injury-prone superficial digital flexor tendon (SDFT) and low strain rarely injured common digital extensor tendon (CDET) in a group of horses with a wide age range. In addition, the turnover of collagen was assessed indirectly by measuring the levels of collagen degradation markers (collagenase-generated neoepitope and cross-linked telopeptide of type I collagen). The fractional increase in D-Asp was similar (p = 0.7) in the SDFT (5.87 x 10(-4)/year) and CDET (5.82 x 10(-4)/year) tissue, and D/L-Asp ratios showed a good correlation with pentosidine levels. We calculated a mean (+/-S.E.) collagen half-life of 197.53 (+/-18.23) years for the SDFT, which increased significantly with horse age (p = 0.03) and was significantly (p < 0.001) higher than that for the CDET (34.03 (+/-3.39) years). Using similar calculations, the half-life of non-collagenous protein was 2.18 (+/-0.41) years in the SDFT and was significantly (p = 0.04) lower than the value of 3.51 (+/-0.51) years for the CDET. Collagen degradation markers were higher in the CDET and suggested an accumulation of partially degraded collagen within the matrix with aging in the SDFT. We propose that increased susceptibility to injury in older individuals results from an inability to remove partially degraded collagen from the matrix leading to reduced mechanical competence.


Assuntos
Envelhecimento/metabolismo , Ácido Aspártico/metabolismo , Colágeno Tipo I/metabolismo , Traumatismos dos Tendões/metabolismo , Tendões/metabolismo , Envelhecimento/patologia , Animais , Biomarcadores/metabolismo , Feminino , Cavalos , Masculino , Traumatismos dos Tendões/patologia , Tendões/patologia
12.
Soft Matter ; 7(7): 3373-3382, 2011 Apr 07.
Artigo em Inglês | MEDLINE | ID: mdl-23526918

RESUMO

Molecular dynamics simulations of collagen are used to investigate at the atomistic level the nature of the interprotein interactions that are present within a collagen fibril, and which are responsible for the fibril's thermodynamic stability. Simulations both of a collagen fibril and of a fully solvated tropcollagen are compared in order to study the interactions that arise between the proteins upon the process of fibrillogenesis. The interactions studied include direct interprotein hydrogen bonds, water-mediated interprotein hydrogen bonds, and hydrophobic interactions. The simulations are used to quantify the number of interprotein interactions that form; to study which functional groups contribute most towards the interactions; and to study the spatial distribution of interprotein interactions throughout the fibril's D period. The processes of collagen fibrillogenesis and protein folding are then compared with each other, because these two physical processes share many similarities in concept, and the latter has been more widely studied. Molecular dynamics simulations of a bacteriophage T4 lysozyme protein, both in its native state and in and unfolded state, are used as an illustrative example of a typical protein folding process, for direct comparison with the collagen simulations.

13.
Analyst ; 136(19): 4013-9, 2011 Oct 07.
Artigo em Inglês | MEDLINE | ID: mdl-21804989

RESUMO

Understanding the basal O(2) and nutrient requirements of cells is paramount when culturing cells in 3D tissue models. Any scaffold design will need to take such parameters into consideration, especially as the addition of cells introduces gradients of consumption of such molecules from the surface to the core of scaffolds. We have cultured two cell types in 3D native collagen type I scaffolds, and measured the O(2) tension at specific locations within the scaffold. By changing the density of cells, we have established O(2) consumption gradients within these scaffolds and using mathematical modeling have derived rates of consumption for O(2). For human dermal fibroblasts the average rate constant was 1.19 × 10(-17) mol cell(-1) s(-1), and for human bone marrow derived stromal cells the average rate constant was 7.91 × 10(-18) mol cell(-1) s(-1). These values are lower than previously published rates for similar cells cultured in 2D, but the values established in this current study are more representative of rates of consumption measured in vivo. These values will dictate 3D culture parameters, including maximum cell-seeding density and maximum size of the constructs, for long-term viability of tissue models.


Assuntos
Técnicas de Cultura de Células , Fibroblastos/metabolismo , Consumo de Oxigênio , Oxigênio/metabolismo , Células Estromais/metabolismo , Células Cultivadas , Fibroblastos/química , Fibroblastos/citologia , Humanos , Cinética , Oxigênio/análise , Células Estromais/química , Células Estromais/citologia
14.
Mol Ecol Resour ; 19(6): 1497-1515, 2019 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-31359622

RESUMO

Whole genome sequences (WGS) greatly increase our ability to precisely infer population genetic parameters, demographic processes, and selection signatures. However, WGS may still be not affordable for a representative number of individuals/populations. In this context, our goal was to assess the efficiency of several SNP genotyping strategies by testing their ability to accurately estimate parameters describing neutral diversity and to detect signatures of selection. We analysed 110 WGS at 12× coverage for four different species, i.e., sheep, goats and their wild counterparts. From these data we generated 946 data sets corresponding to random panels of 1K to 5M variants, commercial SNP chips and exome capture, for sample sizes of five to 48 individuals. We also extracted low-coverage genome resequencing of 1×, 2× and 5× by randomly subsampling reads from the 12× resequencing data. Globally, 5K to 10K random variants were enough for an accurate estimation of genome diversity. Conversely, commercial panels and exome capture displayed strong ascertainment biases. Besides the characterization of neutral diversity, the detection of the signature of selection and the accurate estimation of linkage disequilibrium (LD) required high-density panels of at least 1M variants. Finally, genotype likelihoods increased the quality of variant calling from low coverage resequencing but proportions of incorrect genotypes remained substantial, especially for heterozygote sites. Whole genome resequencing coverage of at least 5× appeared to be necessary for accurate assessment of genomic variations. These results have implications for studies seeking to deploy low-density SNP collections or genome scans across genetically diverse populations/species showing similar genetic characteristics and patterns of LD decay for a wide variety of purposes.


Assuntos
Genoma/genética , Polimorfismo de Nucleotídeo Único/genética , Animais , Exoma/genética , Frequência do Gene/genética , Genética Populacional/métodos , Genômica/métodos , Genótipo , Técnicas de Genotipagem/métodos , Cabras/genética , Heterozigoto , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Desequilíbrio de Ligação/genética , Análise de Sequência de DNA/métodos , Ovinos/genética , Sequenciamento Completo do Genoma/métodos
15.
J Phys Chem B ; 112(13): 4059-66, 2008 Apr 03.
Artigo em Inglês | MEDLINE | ID: mdl-18335926

RESUMO

The finite difference method is used to simulate chronoamperometry, linear sweep voltammetry, and cyclic voltammetry at conical electrodes and microelectrodes. Techniques for the numerical simulation of these processes at microdiscs are adapted and extended to accurately model diffusion to the electroactive cone surface. Simulated results are analyzed, and trends are rationalized in terms of the cone apex angle, alpha. The diffusion domain approximation is used to extend the theory to regular and random arrays of conical electrodes.

16.
J Phys Chem B ; 112(25): 7560-5, 2008 Jun 26.
Artigo em Inglês | MEDLINE | ID: mdl-18507454

RESUMO

The extraction of electrode kinetic parameters for electrochemical couples in room-temperature ionic liquids (RTILs) is currently an area of considerable interest. Electrochemists typically measure electrode kinetics in the limits of either transient planar or steady-state convergent diffusion for which the voltammetric response is well understood. In this paper we develop a general method allowing the extraction of this kinetic data in the region where the diffusion is intermediate between the planar and convergent limits, such as is often encountered in RTILs using microelectrode voltammetry. A general working surface is derived, allowing the inference of Butler-Volmer standard electrochemical rate constants for the peak-to-peak potential separation in a cyclic voltammogram as a function of voltage scan rate. The method is applied to the case of the ferrocene/ferrocenium couple in [C2mim][N(Tf)2] and [C4mim][N(Tf)2].

17.
Stem Cell Reports ; 10(1): 1-6, 2018 01 09.
Artigo em Inglês | MEDLINE | ID: mdl-29320760

RESUMO

Unambiguous cell line authentication is essential to avoid loss of association between data and cells. The risk for loss of references increases with the rapidity that new human pluripotent stem cell (hPSC) lines are generated, exchanged, and implemented. Ideally, a single name should be used as a generally applied reference for each cell line to access and unify cell-related information across publications, cell banks, cell registries, and databases and to ensure scientific reproducibility. We discuss the needs and requirements for such a unique identifier and implement a standard nomenclature for hPSCs, which can be automatically generated and registered by the human pluripotent stem cell registry (hPSCreg). To avoid ambiguities in PSC-line referencing, we strongly urge publishers to demand registration and use of the standard name when publishing research based on hPSC lines.


Assuntos
Bancos de Espécimes Biológicos , Bases de Dados Factuais , Células-Tronco Pluripotentes , Sistema de Registros , Terminologia como Assunto , Humanos
18.
Nat Commun ; 9(1): 813, 2018 03 06.
Artigo em Inglês | MEDLINE | ID: mdl-29511174

RESUMO

The evolutionary basis of domestication has been a longstanding question and its genetic architecture is becoming more tractable as more domestic species become genome-enabled. Before becoming established worldwide, sheep and goats were domesticated in the fertile crescent 10,500 years before present (YBP) where their wild relatives remain. Here we sequence the genomes of wild Asiatic mouflon and Bezoar ibex in the sheep and goat domestication center and compare their genomes with that of domestics from local, traditional, and improved breeds. Among the genomic regions carrying selective sweeps differentiating domestic breeds from wild populations, which are associated among others to genes involved in nervous system, immunity and productivity traits, 20 are common to Capra and Ovis. The patterns of selection vary between species, suggesting that while common targets of selection related to domestication and improvement exist, different solutions have arisen to achieve similar phenotypic end-points within these closely related livestock species.


Assuntos
Animais Domésticos/genética , Domesticação , Genoma , Cabras/genética , Carneiro Doméstico/genética , Animais , Evolução Biológica , Variação Genética/genética , Genômica , Haplótipos , Fenótipo , Filogenia , Seleção Genética , Sequenciamento Completo do Genoma
19.
Gigascience ; 6(7): 1-8, 2017 07 01.
Artigo em Inglês | MEDLINE | ID: mdl-28531267

RESUMO

The 1000 Genomes Project produced more than 100 trillion basepairs of short read sequence from more than 2600 samples in 26 populations over a period of five years. In its final phase, the project released over 85 million genotyped and phased variants on human reference genome assembly GRCh37. An updated reference assembly, GRCh38, was released in late 2013, but there was insufficient time for the final phase of the project analysis to change to the new assembly. Although it is possible to lift the coordinates of the 1000 Genomes Project variants to the new assembly, this is a potentially error-prone process as coordinate remapping is most appropriate only for non-repetitive regions of the genome and those that did not see significant change between the two assemblies. It will also miss variants in any region that was newly added to GRCh38. Thus, to produce the highest quality variants and genotypes on GRCh38, the best strategy is to realign the reads and recall the variants based on the new alignment. As the first step of variant calling for the 1000 Genomes Project data, we have finished remapping all of the 1000 Genomes sequence reads to GRCh38 with alternative scaffold-aware BWA-MEM. The resulting alignments are available as CRAM, a reference-based sequence compression format. The data have been released on our FTP site and are also available from European Nucleotide Archive to facilitate researchers discovering variants on the primary sequences and alternative contigs of GRCh38.


Assuntos
Mapeamento de Sequências Contíguas/métodos , Projeto Genoma Humano , Alinhamento de Sequência/métodos , Sequenciamento Completo do Genoma/métodos , Algoritmos , Mapeamento de Sequências Contíguas/normas , Humanos , Padrões de Referência , Alinhamento de Sequência/normas , Sequenciamento Completo do Genoma/normas
20.
Stem Cell Res ; 20: 105-114, 2017 04.
Artigo em Inglês | MEDLINE | ID: mdl-28334554

RESUMO

A fast track "Hot Start" process was implemented to launch the European Bank for Induced Pluripotent Stem Cells (EBiSC) to provide early release of a range of established control and disease linked human induced pluripotent stem cell (hiPSC) lines. Established practice amongst consortium members was surveyed to arrive at harmonised and publically accessible Standard Operations Procedures (SOPs) for tissue procurement, bio-sample tracking, iPSC expansion, cryopreservation, qualification and distribution to the research community. These were implemented to create a quality managed foundational collection of lines and associated data made available for distribution. Here we report on the successful outcome of this experience and work flow for banking and facilitating access to an otherwise disparate European resource, with lessons to benefit the international research community. ETOC: The report focuses on the EBiSC experience of rapidly establishing an operational capacity to procure, bank and distribute a foundational collection of established hiPSC lines. It validates the feasibility and defines the challenges of harnessing and integrating the capability and productivity of centres across Europe using commonly available resources currently in the field.


Assuntos
Bancos de Espécimes Biológicos , Células-Tronco Pluripotentes Induzidas/citologia , Linhagem Celular , Criopreservação , Europa (Continente) , Humanos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA