Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 32
Filtrar
1.
Bioinformatics ; 30(12): 1651-9, 2014 Jun 15.
Artigo em Inglês | MEDLINE | ID: mdl-24532718

RESUMO

MOTIVATION: Advances in sequencing technologies and computational algorithms have enabled the study of genomic variants to dissect their functional consequence. Despite this unprecedented progress, current tools fail to reliably detect and characterize more complex allelic variants, such as short tandem repeats (STRs). We developed TSSV as an efficient and sensitive tool to specifically profile all allelic variants present in targeted loci. Based on its design, requiring only two short flanking sequences, TSSV can work without the use of a complete reference sequence to reliably profile highly polymorphic, repetitive or uncharacterized regions. RESULTS: We show that TSSV can accurately determine allelic STR structures in mixtures with 10% representation of minor alleles or complex mixtures in which a single STR allele is shared. Furthermore, we show the universal utility of TSSV in two other independent studies: characterizing de novo mutations introduced by transcription activator-like effector nucleases (TALENs) and profiling the noise and systematic errors in an IonTorrent sequencing experiment. TSSV complements the existing tools by aiding the study of highly polymorphic and complex regions and provides a high-resolution map that can be used in a wide range of applications, from personal genomics to forensic analysis and clinical diagnostics. AVAILABILITY AND IMPLEMENTATION: We have implemented TSSV as a Python package that can be installed through the command-line using pip install TSSV command. Its source code and documentation are available at https://pypi.python.org/pypi/tssv and http://www.lgtc.nl/tssv.


Assuntos
Alelos , Genômica/métodos , Repetições de Microssatélites , Software , Algoritmos , Desoxirribonucleases/metabolismo , Distrofina/genética , Feminino , Genoma Humano , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Masculino , Mutação , Análise de Sequência de DNA
2.
Am J Hum Genet ; 86(3): 364-77, 2010 Mar 12.
Artigo em Inglês | MEDLINE | ID: mdl-20206332

RESUMO

Subtelomeres are dynamic structures composed of blocks of homologous DNA sequences. These so-called duplicons are dispersed over many chromosome ends. We studied the human 4q and 10q subtelomeres, which contain the polymorphic macrosatellite repeat D4Z4 and which share high sequence similarity over a region of, on average, >200 kb. Sequence analysis of four polymorphic markers in the African, European, and Asian HAPMAP panels revealed 17 subtelomeric 4q and eight subtelomeric 10qter haplotypes. Haplotypes that are composed of a mixture of 4q and 10q sequences were detected at frequencies >10% in all three populations, seemingly supporting a mechanism of ongoing interchromosomal exchanges between these chromosomes. We constructed an evolutionary network of most haplotypes and identified the 4q haplotype ancestral to all 4q and 10q haplotypes. According to the network, all subtelomeres originate from only four discrete sequence-transfer events during human evolution, and haplotypes with mixtures of 4q- and 10q-specific sequences represent intermediate structures in the transition from 4q to 10q subtelomeres. Haplotype distribution studies on a large number of globally dispersed human DNA samples from the HGDP-CEPH panel supported our findings and show that all haplotypes were present before human migration out of Africa. D4Z4 repeat array contractions on the 4A161 haplotype cause Facioscapulohumeral muscular dystrophy (FSHD), whereas contractions on most other haplotypes are nonpathogenic. We propose that the limited occurrence of interchromosomal sequence transfers results in an accumulation of haplotype-specific polymorphisms that can explain the unique association of FSHD with D4Z4 contractions in a single 4q subtelomere.


Assuntos
Cromossomos Humanos Par 10/genética , Cromossomos Humanos Par 4/genética , Evolução Molecular , Genética Populacional , Telômero/genética , Alelos , Sequência de Bases , DNA/genética , Primers do DNA/genética , Bases de Dados de Ácidos Nucleicos , Haplótipos , Humanos , Dados de Sequência Molecular , Polimorfismo Genético , Sequências Repetitivas de Ácido Nucleico , Homologia de Sequência do Ácido Nucleico
3.
Int J Legal Med ; 127(4): 741-7, 2013 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-23306520

RESUMO

DNA profiles from degraded samples often suffer from information loss at the longer short tandem repeat (STR) loci. Sensitising the reactions, either by performing additional PCR cycles or increasing the capillary electrophoresis injection settings, carries the risk of over-amplifying or overloading the shorter fragments. We explored whether profiling of degraded DNA can be improved by preferential capturing of the longer amplified fragments. To this aim, a post-PCR purification protocol was developed that is based on AMPure XP beads that have size-selective properties. A comparison was made with an unselective post-PCR purification system (DTR gel filtration) and no purification of the PCR products. Besides a set of differently and serially degraded single source samples, unequal mixtures of degraded DNAs were analysed, in order to extract more genotyping information for the minor contributor without overloading the major component at the shorter amplicons. Purification by the AMPure protocol resulted in higher peak heights especially for the longer amplicons, while DTR gel filtration gave higher peaks for all amplicon sizes. Both purification methods presented more detected alleles, with the AMPure protocol performing slightly better, on average. In conclusion, the in-house developed AMPure protocol can be employed to improve STR profile analysis of degraded single source and (unequally) mixed DNA samples.


Assuntos
Degradação Necrótica do DNA , Impressões Digitais de DNA/métodos , Repetições de Microssatélites , Alelos , Humanos , Reação em Cadeia da Polimerase , Análise de Sequência de DNA/instrumentação , Análise de Sequência de DNA/métodos
4.
Am J Phys Anthropol ; 151(1): 58-67, 2013 May.
Artigo em Inglês | MEDLINE | ID: mdl-23526367

RESUMO

The Maldives are an 850 km-long string of atolls located centrally in the northern Indian Ocean basin. Because of this geographic situation, the present-day Maldivian population has potential for uncovering genetic signatures of historic migration events in the region. We therefore studied autosomal DNA-, mitochondrial DNA-, and Y-chromosomal DNA markers in a representative sample of 141 unrelated Maldivians, with 119 from six major settlements. We found a total of 63 different mtDNA haplotypes that could be allocated to 29 mtDNA haplogroups, mostly within the M, R, and U clades. We found 66 different Y-STR haplotypes in 10 Y-chromosome haplogroups, predominantly H1, J2, L, R1a1a, and R2. Parental admixture analysis for mtDNA- and Y-haplogroup data indicates a strong genetic link between the Maldive Islands and mainland South Asia, and excludes significant gene flow from Southeast Asia. Paternal admixture from West Asia is detected, but cannot be distinguished from admixture from South Asia. Maternal admixture from West Asia is excluded. Within the Maldives, we find a subtle genetic substructure in all marker systems that is not directly related to geographic distance or linguistic dialect. We found reduced Y-STR diversity and reduced male-mediated gene flow between atolls, suggesting independent male founder effects for each atoll. Detected reduced female-mediated gene flow between atolls confirms a Maldives-specific history of matrilocality. In conclusion, our new genetic data agree with the commonly reported Maldivian ancestry in South Asia, but furthermore suggest multiple, independent immigration events and asymmetrical migration of females and males across the archipelago.


Assuntos
Povo Asiático/genética , Fluxo Gênico , Migração Humana/história , Cromossomos Humanos Y , DNA Mitocondrial/genética , Feminino , Marcadores Genéticos/genética , Genética Populacional , Haplótipos , História Antiga , Humanos , Ilhas do Oceano Índico , Masculino , Polimorfismo de Nucleotídeo Único
5.
Forensic Sci Int Genet ; 65: 102878, 2023 07.
Artigo em Inglês | MEDLINE | ID: mdl-37116245

RESUMO

Tobacco smoking is a frequent habit sustained by > 1.3 billion people in 2020 and the leading preventable factor for health risk and premature mortality worldwide. In the forensic context, predicting smoking habits from biological samples may allow broadening DNA phenotyping. In this study, we aimed to implement previously published smoking habit classification models based on blood DNA methylation at 13 CpGs. First, we developed a matching lab tool based on bisulfite conversion and multiplex PCR followed by amplification-free library preparation and targeted paired-end massively parallel sequencing (MPS). Analysis of six technical duplicates revealed high reproducibility of methylation measurements (Pearson correlation of 0.983). Artificially methylated standards uncovered marker-specific amplification bias, which we corrected via bi-exponential models. We then applied our MPS tool to 232 blood samples from Europeans of a wide age range, of which 90 were current, 71 former and 71 never smokers. On average, we obtained 189,000 reads/sample and 15,000 reads/CpG, without marker drop-out. Methylation distributions per smoking category roughly corresponded to previous microarray analysis, showcasing large inter-individual variation but with technology-driven bias. Methylation at 11 out of 13 smoking-CpGs correlated with daily cigarettes in current smokers, while solely one was weakly correlated with time since cessation in former smokers. Interestingly, eight smoking-CpGs correlated with age, and one displayed weak but significant sex-associated methylation differences. Using bias-uncorrected MPS data, smoking habits were relatively accurately predicted using both two- (current/non-current) and three- (never/former/current) category model, but bias correction resulted in worse prediction performance for both models. Finally, to account for technology-driven variation, we built new, joint models with inter-technology corrections, which resulted in improved prediction results for both models, with or without PCR bias correction (e.g. MPS cross-validation F1-score > 0.8; 2-categories). Overall, our novel assay takes us one step closer towards the forensic application of viable smoking habit prediction from blood traces. However, future research is needed towards forensically validating the assay, especially in terms of sensitivity. We also need to further shed light on the employed biomarkers, particularly on the mechanistics, tissue specificity and putative confounders of smoking epigenetic signatures.


Assuntos
Metilação de DNA , Fumar , Humanos , Reprodutibilidade dos Testes , Fumar/genética , Reação em Cadeia da Polimerase , Sequenciamento de Nucleotídeos em Larga Escala , Ilhas de CpG/genética
6.
Int J Legal Med ; 126(5): 781-9, 2012 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-22752809

RESUMO

The autosomal short tandem repeat (STR) kits that are currently used in forensic science have a high discrimination power. However, this discrimination power is sometimes not sufficient for complex kinship analyses or decreases when alleles are missing due to degradation of the DNA. The Investigator HDplex kit contains nine STRs that are additional to the commonly used forensic markers, and we validated this kit to assist human identification. With the increasing number of markers it becomes inevitable that forensic and kinship analyses include two or more STRs present on the same chromosome. To examine whether such markers can be regarded as independent, we evaluated the 30 STRs present in NGM, Identifiler and HDplex. Among these 30 markers, 17 syntenic STR pairs can be formed. Allelic association between these pairs was examined using 335 Dutch reference samples and no linkage disequilibrium was detected, which makes it possible to use the product rule for profile probability calculations in unrelated individuals. Linkage between syntenic STRs was studied by determining the recombination fraction between them in five three-generation CEPH families. The recombination fractions were compared to the physical and genetic distances between the markers. For most types of pedigrees, the kinship analyses can be performed using the product rule, and for those cases that require an alternative calculation method (Gill et al., Forensic Sci Int Genet 6:477-486, 2011), the recombination fractions as determined in this study can be used. Finally, we calculated the (combined) match probabilities, for the supplementary genotyping results of HDplex, NGM and Identifiler.


Assuntos
Alelos , Impressões Digitais de DNA/métodos , Genética Forense/métodos , Marcadores Genéticos/genética , Genética Populacional/métodos , Análise Heteroduplex/métodos , Repetições de Microssatélites/genética , Adulto , Idoso , Amelogenina/genética , Criança , Feminino , Frequência do Gene , Loci Gênicos/genética , Genótipo , Projeto HapMap , Humanos , Desequilíbrio de Ligação , Masculino , Países Baixos
7.
Forensic Sci Int Genet ; 61: 102768, 2022 11.
Artigo em Inglês | MEDLINE | ID: mdl-35994887

RESUMO

The introduction of massively parallel sequencing in forensic analysis has been facilitated with typing kits, analysis software and allele naming tools such as the ForenSeq DNA Signature Prep (DSP) kit, FDSTools and STRNaming respectively. Here we describe how FDSTools 2.0 with integrated and refined STRNaming nomenclature was validated for implementation under ISO 17025 accreditation for the ForenSeq DSP kit. Newly-added options result in efficient automatic allele calling for the majority of markers while specific settings are applied for 'novel' sequence variants to avoid the calling of remaining variable noise observed in samples sequenced with the ForenSeq DSP kit that seem to arise in the PCR. Genome-wide built-in reference data allows for greatly simplified configuration of allele naming for human targets.


Assuntos
Impressões Digitais de DNA , Repetições de Microssatélites , Humanos , Sequenciamento de Nucleotídeos em Larga Escala , Alelos , DNA , Análise de Sequência de DNA , Polimorfismo de Nucleotídeo Único
8.
Forensic Sci Int Genet ; 58: 102675, 2022 05.
Artigo em Inglês | MEDLINE | ID: mdl-35144074

RESUMO

The possibility of providing investigative leads when conventional DNA identification methods fail to solve a case can be of extreme relevance to law enforcement. Therefore, the forensic genetics community has focused research towards the broadened use of DNA, particularly for prediction of appearance traits, bio-geographical ancestry and age. The VISible Attributes through GEnomics (VISAGE) Consortium expanded the use of DNA phenotyping by developing new molecular and statistical tools for appearance, age and ancestry prediction. The VISAGE basic tool for appearance (EVC) and ancestry (BGA) prediction was initially developed using Ampliseq chemistry, but here is being evaluated using ForenSeq chemistry. The VISAGE basic tool offers a total of 41 EVC and 115 BGA SNPs and thus provides more predictions, i.e., skin color, than achieved with the ForenSeq DNA Signature Prep kit that is based on 24 EVC and 56 BGA SNPs. Five VISAGE laboratories participated in collaborative experiments to provide foreground for developmental validation of the assay. Assessment of assay performance and quality metrics, reproducibility, sensitivity, inhibitor tolerance and species specificity are described. Furthermore, the assay was tested using challenging samples such as mock casework samples and artificially degraded DNA. Two different analysis strategies were applied for this study and output on genotype calls and read depth was compared. Overall, inter-laboratory, inter-method and concordance with publicly available data were analysed and compared. Finally, the results showed a reliable and robust tool, which can be easily applied for laboratories already using a MiSeq FGx with ForenSeq reagents.


Assuntos
Impressões Digitais de DNA , Sequenciamento de Nucleotídeos em Larga Escala , Impressões Digitais de DNA/métodos , Genética Forense/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Polimorfismo de Nucleotídeo Único , Reprodutibilidade dos Testes , Análise de Sequência de DNA/métodos , Especificidade da Espécie
9.
Mol Biol Evol ; 27(2): 385-93, 2010 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-19822636

RESUMO

We have investigated human male demographic history using 590 males from 51 populations in the Human Genome Diversity Project - Centre d'Etude du Polymorphisme Humain worldwide panel, typed with 37 Y-chromosomal Single Nucleotide Polymorphisms and 65 Y-chromosomal Short Tandem Repeats and analyzed with the program Bayesian Analysis of Trees With Internal Node Generation. The general patterns we observe show a gradient from the oldest population time to the most recent common ancestors (TMRCAs) and expansion times together with the largest effective population sizes in Africa, to the youngest times and smallest effective population sizes in the Americas. These parameters are significantly negatively correlated with distance from East Africa, and the patterns are consistent with most other studies of human variation and history. In contrast, growth rate showed a weaker correlation in the opposite direction. Y-lineage diversity and TMRCA also decrease with distance from East Africa, supporting a model of expansion with serial founder events starting from this source. A number of individual populations diverge from these general patterns, including previously documented examples such as recent expansions of the Yoruba in Africa, Basques in Europe, and Yakut in Northern Asia. However, some unexpected demographic histories were also found, including low growth rates in the Hazara and Kalash from Pakistan and recent expansion of the Mozabites in North Africa.


Assuntos
Cromossomos Humanos Y/genética , Variação Genética/genética , Genética Populacional/métodos , Repetições de Microssatélites/genética , Polimorfismo de Nucleotídeo Único/genética , África do Norte , Evolução Molecular , Geografia , Humanos , Masculino , Filogenia
10.
Forensic Sci Int Genet ; 52: 102473, 2021 05.
Artigo em Inglês | MEDLINE | ID: mdl-33607395

RESUMO

The introduction of Massively Parallel Sequencing in the forensic domain has exposed the need for comprehensive nomenclature of sequenced Short Tandem Repeat (STR) alleles. In general, three strategies are at hand: 1) the full sequence mapped to the human genome reference sequence, which ensures exact data exchange; 2) shortened, human-readable formats for forensic reporting and data presentation and 3) very short codes that enable compact figures and tables but do not convey any sequence information. Here, we describe an algorithm of the second type: STRNaming, which generates human-readable names for sequenced STR alleles. STRNaming is guided by a reference sequence at each locus and then functions independently to automatically assign a unique, sequence-descriptive name that also includes the capillary electrophoresis allele number. STRNaming settings were established based on preferences that were surveyed internationally in the forensic community. These settings ensure that a small change in the sequence corresponds to a small change in the allele name, which is helpful for recognising for instance stutter products. Sequence variants outside of the repeat units are indicated as simple variant calls. Since the STR name is sequence-descriptive, the sequence can be traced back from the allele name. Because STRNaming is fully guided by an assignable reference sequence, no central coordination or configuration is required and the method will work for any STR locus, be it autosomal, Y-, X-chromosomal in current or future use. The algorithm is publicly available online and offline.


Assuntos
Algoritmos , Alelos , Repetições de Microssatélites , Impressões Digitais de DNA , Genoma Humano , Humanos , Análise de Sequência de DNA
11.
Forensic Sci Int Genet ; 52: 102489, 2021 05.
Artigo em Inglês | MEDLINE | ID: mdl-33677249

RESUMO

The interpretation of short tandem repeat (STR) profiles can be challenging when, for example, alleles are masked due to allele sharing among contributors and/or when they are subject to drop-out, for instance from sample degradation. Mixture interpretation can be improved by increasing the number of STRs and/or loci with a higher discriminatory power. Both capillary electrophoresis (CE, 6-dye) and massively parallel sequencing (MPS) provide a platform for analysing relatively large numbers of autosomal STRs. In addition, MPS enables distinguishing between sequence variants, resulting in enlarged discriminatory power. Also, MPS allows for small amplicon sizes for all loci as spacing is not an issue, which is beneficial with degraded DNA. Altogether, MPS has the potential to increase the weights of evidence for true contributors to (complex) DNA profiles. In this study, likelihood ratio (LR) calculations were performed using STR profiles obtained with two different MPS systems and analysed using different settings: 1) MPS PowerSeq™ Auto System profiles analysed using FDSTools equipped with optimized settings such as noise correction, 2) ForenSeq™ DNA Signature Prep Kit profiles analysed using the default settings in the Universal Analysis Software (UAS), and 3) ForenSeq™ DNA Signature Prep Kit profiles analysed using FDSTools empirically adapted to cope with one-directional reads and provisional, basic settings. The LR calculations used genotyping data for two- to four-person mixtures varying for mixture proportion, level of drop-out and allele sharing and were generated with the continuous model EuroForMix. The LR results for the over 2000 sets of propositions were affected by the variation for the number of markers and analysis settings used in the three approaches. Nevertheless, trends for true and non-contributors, effects of replicates, assigned number of contributors, and model validation results were comparable for the three MPS approaches and alike the trends known for CE data. Based on this analogy, we regard the probabilistic interpretation of MPS STR data fit for forensic DNA casework. In addition, guidelines were derived on when to apply LR calculations to MPS autosomal STR data and report the corresponding results.


Assuntos
Impressões Digitais de DNA , Sequenciamento de Nucleotídeos em Larga Escala , Funções Verossimilhança , Software , Alelos , Eletroforese Capilar , Genótipo , Humanos , Repetições de Microssatélites , Análise de Sequência de DNA
12.
Hum Mutat ; 31(12): E1875-93, 2010 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-20886636

RESUMO

The current U.S. population represents an amalgam of individuals originating mainly from four continental regions (Africa, Europe, Asia and America). To study the genetic ancestry and compare with self-declared ancestry we have analyzed paternally, maternally and bi-parentally inherited DNA markers sensitive for indicating continental genetic ancestry in all four major U.S. American groups. We found that self-declared U.S. Hispanics and U.S. African Americans tend to show variable degrees of continental genetic admixture among the three genetic systems, with evidence for a marked sex-biased admixture history. Moreover, for these two groups we observed significant regional variation across the country in genetic admixture. In contrast, self-declared U.S. European and U.S. Asian Americans were genetically more homogeneous at the continental ancestry level. Two autosomal ancestry-sensitive markers located in skin pigmentation candidate genes showed significant differences in self-declared U.S. African Americans or U.S. European Americans, relative to their assumed parental populations from Africa or Europe. This provides genetic support for the importance of skin color in the complex process of ancestry identification.


Assuntos
Cromossomos Humanos Y/genética , DNA Mitocondrial/genética , Genealogia e Heráldica , Linhagem , Grupos Raciais/genética , Negro ou Afro-Americano/genética , Povo Asiático/genética , Europa (Continente)/etnologia , Feminino , Marcadores Genéticos , Genoma Humano/genética , Hispânico ou Latino/genética , Humanos , Masculino , Estados Unidos
13.
Ann Hum Genet ; 74(1): 65-76, 2010 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-20059473

RESUMO

We investigated the bio-geographic ancestry of Argentineans, and quantified their genetic admixture, analyzing 246 unrelated male individuals from eight provinces of three Argentinean regions using ancestry-sensitive DNA markers (ASDM) from autosomal, Y and mitochondrial chromosomes. Our results demonstrate that European, Native American and African ancestry components were detectable in the contemporary Argentineans, the amounts depending on the genetic system applied, exhibiting large inter-individual heterogeneity. Argentineans carried a large fraction of European genetic heritage in their Y-chromosomal (94.1%) and autosomal (78.5%) DNA, but their mitochondrial gene pool is mostly of Native American ancestry (53.7%); instead, African heritage was small in all three genetic systems (<4%). Population substructure in Argentina considering the eight sampled provinces was very small based on autosomal (0.92% of total variation was between provincial groups, p = 0.005) and mtDNA (1.77%, p = 0.005) data (none with NRY data), and all three genetic systems revealed no substructure when clustering the provinces into the three geographic regions to which they belong. The complex genetic ancestry picture detected in Argentineans underscores the need to apply ASDM from all three genetic systems to infer geographic origins and genetic admixture. This applies to all worldwide areas where people with different continental ancestry live geographically close together.


Assuntos
Cromossomos Humanos Y , DNA Mitocondrial , Argentina , População Negra/genética , Cromossomos , Pool Gênico , Genética Populacional , Hispânico ou Latino/genética , Humanos , Indígenas Norte-Americanos/genética , Masculino , População Branca/genética
14.
Eur J Hum Genet ; 28(3): 287-299, 2020 03.
Artigo em Inglês | MEDLINE | ID: mdl-31488894

RESUMO

Previous studies indicated existing, albeit limited, genetic-geographic population substructure in the Dutch population based on genome-wide data and a lack of this for mitochondrial SNP based data. Despite the aforementioned studies, Y-chromosomal SNP data from the Netherlands remain scarce and do not cover the territory of the Netherlands well enough to allow a reliable investigation of genetic-geographic population substructure. Here we provide the first substantial dataset of detailed spatial Y-chromosomal haplogroup information in 2085 males collected across the Netherlands and supplemented with previously published data from northern Belgium. We found Y-chromosomal evidence for genetic-geographic population substructure, and several Y-haplogroups demonstrating significant clinal frequency distributions in different directions. By means of prediction surface maps we could visualize (complex) distribution patterns of individual Y-haplogroups in detail. These results highlight the value of a micro-geographic approach and are of great use for forensic and epidemiological investigations and our understanding of the Dutch population history. Moreover, the previously noted absence of genetic-geographic population substructure in the Netherlands based on mitochondrial DNA in contrast to our Y-chromosome results, hints at different population histories for women and men in the Netherlands.


Assuntos
Cromossomos Humanos Y/genética , Polimorfismo de Nucleotídeo Único , População/genética , Haplótipos , Humanos , Masculino , Países Baixos
15.
Eur J Hum Genet ; 28(3): 399, 2020 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-31645767

RESUMO

An amendment to this paper has been published and can be accessed via a link at the top of the paper.

16.
Forensic Sci Int Genet ; 43: 102152, 2019 11.
Artigo em Inglês | MEDLINE | ID: mdl-31518964

RESUMO

Forensic DNA Phenotyping (FDP) provides the ability to predict externally visible characteristics from minute amounts of crime scene DNA, which can help find unknown perpetrators who are typically unidentifiable via conventional forensic DNA profiling. Fundamental human genetics research has led to a better understanding of the specific DNA variants responsible for physical appearance characteristics, particularly eye, hair, and skin color. Recently, we introduced the HIrisPlex-S system for the simultaneous prediction of eye, hair, and skin color based on 41 DNA variants generated from two forensically validated SNaPshot multiplex assays using capillary electrophoresis (CE). Here we introduce massively parallel sequencing (MPS) solutions for the HIrisPlex-S (HPS) system on two MPS platforms commonly used in forensics, Ion Torrent and MiSeq, that cover all 41 DNA variants in a single assay, respectively. Additionally, we present the forensic developmental validation of the two HPS-MPS assays. The Ion Torrent MPS assay, based on Ion AmpliSeq technology, illustrated the successful generation of full HIrisPlex-S genotypic profiles from 100 pg of input control DNA, while the MiSeq MPS assay based on an in-house design yielded complete profiles from 250 pg of input DNA. Assessing simulated forensic casework samples such as saliva, hair (bulb), blood, semen, and low quantity touch DNA, as well as artificially damaged DNA samples, concordance testing, and samples from numerous species, all illustrated the ability of both versions of the HIrisPlex-S MPS assay to produce results that motivate forensic applications. By also providing an integrated bioinformatics analysis pipeline, MPS data can now be analyzed and a file generated for upload to the publically accessible HIrisPlex online webtool (https://hirisplex.erasmusmc.nl). In addition, we updated the website to accept VCF input data for those with genome sequence data. We thus provide a user-friendly and semi-automated MPS workflow from DNA sample to individual eye, hair, and skin color prediction probabilities. Furthermore, we present a 2-person mixture separation tool that not only assesses genotype reliability with regards genotyping confidence but also provides the most fitting mixture scenario for both minor and major contributors, including profile separation. We envision this MPS implementation of the HIrisPlex-S system for eye, hair, and skin color prediction from DNA as a starting point for further expanding MPS-based forensic DNA phenotyping. This may include the future addition of SNPs predictive for more externally visible characteristics, as well as SNPs for bio-geographic ancestry inference, provided the statistical framework for DNA prediction of these traits is in place.


Assuntos
Cor de Olho/genética , Técnicas de Genotipagem/instrumentação , Cor de Cabelo/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Polimorfismo de Nucleotídeo Único , Pigmentação da Pele/genética , Animais , DNA/genética , Genótipo , Humanos , Fenótipo , Reação em Cadeia da Polimerase , Especificidade da Espécie
17.
Forensic Sci Int Genet ; 35: 169-175, 2018 07.
Artigo em Inglês | MEDLINE | ID: mdl-29852469

RESUMO

Since two decades, short tandem repeats (STRs) are the preferred markers for human identification, routinely analysed by fragment length analysis. Here we present a novel set of short hypervariable autosomal microhaplotypes (MH) that have four or more SNPs in a span of less than 70 nucleotides (nt). These MHs display a discriminating power approaching that of STRs and provide a powerful alternative for the analysis;1;is of forensic samples that are problematic when the STR fragment size range exceeds the integrity range of severely degraded DNA or when multiple donors contribute to an evidentiary stain and STR stutter artefacts complicate profile interpretation. MH typing was developed using the power of massively parallel sequencing (MPS) enabling new powerful, fast and efficient SNP-based approaches. MH candidates were obtained from queries in data of the 1000 Genomes, and Genome of the Netherlands (GoNL) projects. Wet-lab analysis of 276 globally dispersed samples and 97 samples of nine large CEPH families assisted locus selection and corroboration of informative value. We infer that MHs represent an alternative marker type with good discriminating power per locus (allowing the use of a limited number of loci), small amplicon sizes and absence of stutter artefacts that can be especially helpful when unbalanced mixed samples are submitted for human identification.


Assuntos
Impressões Digitais de DNA/métodos , Haplótipos , Polimorfismo de Nucleotídeo Único , Alelos , Artefatos , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Reação em Cadeia da Polimerase Multiplex , Análise de Sequência de DNA
18.
Forensic Sci Int Genet ; 37: 241-251, 2018 11.
Artigo em Inglês | MEDLINE | ID: mdl-30268682

RESUMO

Human head hair shape, commonly classified as straight, wavy, curly or frizzy, is an attractive target for Forensic DNA Phenotyping and other applications of human appearance prediction from DNA such as in paleogenetics. The genetic knowledge underlying head hair shape variation was recently improved by the outcome of a series of genome-wide association and replication studies in a total of 26,964 subjects, highlighting 12 loci of which 8 were novel and introducing a prediction model for Europeans based on 14 SNPs. In the present study, we evaluated the capacity of DNA-based head hair shape prediction by investigating an extended set of candidate SNP predictors and by using an independent set of samples for model validation. Prediction model building was carried out in 9674 subjects (6068 from Europe, 2899 from Asia and 707 of admixed European and Asian ancestries), used previously, by considering a novel list of 90 candidate SNPs. For model validation, genotype and phenotype data were newly collected in 2415 independent subjects (2138 Europeans and 277 non-Europeans) by applying two targeted massively parallel sequencing platforms, Ion Torrent PGM and MiSeq, or the MassARRAY platform. A binomial model was developed to predict straight vs. non-straight hair based on 32 SNPs from 26 genetic loci we identified as significantly contributing to the model. This model achieved prediction accuracies, expressed as AUC, of 0.664 in Europeans and 0.789 in non-Europeans; the statistically significant difference was explained mostly by the effect of one EDAR SNP in non-Europeans. Considering sex and age, in addition to the SNPs, slightly and insignificantly increased the prediction accuracies (AUC of 0.680 and 0.800, respectively). Based on the sample size and candidate DNA markers investigated, this study provides the most robust, validated, and accurate statistical prediction models and SNP predictor marker sets currently available for predicting head hair shape from DNA, providing the next step towards broadening Forensic DNA Phenotyping beyond pigmentation traits.


Assuntos
DNA/genética , Cabelo , Fenótipo , Polimorfismo de Nucleotídeo Único , Adulto , Estudo de Associação Genômica Ampla , Técnicas de Genotipagem/instrumentação , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Modelos Logísticos , Modelos Genéticos , Análise de Sequência de DNA
19.
Forensic Sci Int Genet ; 27: 27-40, 2017 03.
Artigo em Inglês | MEDLINE | ID: mdl-27914278

RESUMO

Massively parallel sequencing (MPS) is on the advent of a broad scale application in forensic research and casework. The improved capabilities to analyse evidentiary traces representing unbalanced mixtures is often mentioned as one of the major advantages of this technique. However, most of the available software packages that analyse forensic short tandem repeat (STR) sequencing data are not well suited for high throughput analysis of such mixed traces. The largest challenge is the presence of stutter artefacts in STR amplifications, which are not readily discerned from minor contributions. FDSTools is an open-source software solution developed for this purpose. The level of stutter formation is influenced by various aspects of the sequence, such as the length of the longest uninterrupted stretch occurring in an STR. When MPS is used, STRs are evaluated as sequence variants that each have particular stutter characteristics which can be precisely determined. FDSTools uses a database of reference samples to determine stutter and other systemic PCR or sequencing artefacts for each individual allele. In addition, stutter models are created for each repeating element in order to predict stutter artefacts for alleles that are not included in the reference set. This information is subsequently used to recognise and compensate for the noise in a sequence profile. The result is a better representation of the true composition of a sample. Using Promega Powerseq™ Auto System data from 450 reference samples and 31 two-person mixtures, we show that the FDSTools correction module decreases stutter ratios above 20% to below 3%. Consequently, much lower levels of contributions in the mixed traces are detected. FDSTools contains modules to visualise the data in an interactive format allowing users to filter data with their own preferred thresholds.


Assuntos
Artefatos , Sequenciamento de Nucleotídeos em Larga Escala , Repetições de Microssatélites , Software , Alelos , Amelogenina/genética , Impressões Digitais de DNA , Humanos , Reação em Cadeia da Polimerase
20.
Forensic Sci Int Genet ; 30: 66-70, 2017 09.
Artigo em Inglês | MEDLINE | ID: mdl-28633070

RESUMO

We have genotyped the 58 STRs (27 autosomal, 24 Y-STRs and 7 X-STRs) and 94 autosomal SNPs in Illumina ForenSeq™ Primer Mix A in 88 Spanish Roma (Gypsy) samples and 143 Catalans. Since this platform is based in massive parallel sequencing, we have used simple R scripts to uncover the sequence variation in the repeat region. Thus, we have found, across 58 STRs, 541 length-based alleles, which, after considering repeat-sequence variation, became 804 different alleles. All loci in both populations were in Hardy-Weinberg equilibrium. FST between both populations was 0.0178 for autosomal SNPs, 0.0146 for autosomal STRs, 0.0101 for X-STRs and 0.1866 for Y-STRs. Combined a priori statistics showed quite large; for instance, pooling all the autosomal loci, the a priori probabilities of discriminating a suspect become 1-(2.3×10-70) and 1-(5.9×10-73), for Roma and Catalans respectively, and the chances of excluding a false father in a trio are 1-(2.6×10-20) and 1-(2.0×10-21).


Assuntos
Etnicidade/genética , Repetições de Microssatélites , Polimorfismo de Nucleotídeo Único , Alelos , Cromossomos Humanos X , Cromossomos Humanos Y , Feminino , Genética Populacional , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Masculino , Espanha
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA