RESUMO
We present results from an inter-laboratory massively parallel sequencing (MPS) study in the framework of the SeqForSTRs project to evaluate forensically relevant parameters, such as performance, concordance, and sensitivity, using a standardized sequencing library including reference material, mixtures, and ancient DNA samples. The standardized library was prepared using the ForenSeq DNA Signature Prep Kit (primer mix A). The library was shared between eight European laboratories located in Austria, France, Germany, The Netherlands, and Sweden to perform MPS on their particular MiSeq FGx sequencers. Despite variation in performance between sequencing runs, all laboratories obtained quality metrics that fell within the manufacturer's recommended ranges. Furthermore, differences in locus coverage did not inevitably adversely affect heterozygous balance. Inter-laboratory concordance showed 100% concordant genotypes for the included autosomal and Y-STRs, and still, X-STR concordance exceeded 83%. The exclusive reasons for X-STR discordances were drop-outs at DXS10103. Sensitivity experiments demonstrated that correct allele calling varied between sequencing instruments in particular for lower DNA amounts (≤ 125 pg). The analysis of compromised DNA samples showed the drop-out of one sample (FA10013B01A) while for the remaining three degraded DNA samples MPS was able to successfully type ≥ 87% of all aSTRs, ≥ 78% of all Y-STRs, ≥ 68% of all X-STRs, and ≥ 92% of all iSNPs demonstrating that MPS is a promising tool for human identity testing, which in return, has to undergo rigorous in-house validation before it can be implemented into forensic routine casework.
Assuntos
Impressões Digitais de DNA/métodos , Biblioteca Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Repetições de Microssatélites , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA , Alelos , Áustria , Eletroforese Capilar , Feminino , França , Alemanha , Humanos , Laboratórios , Masculino , Países Baixos , Reação em Cadeia da Polimerase , Sensibilidade e Especificidade , SuéciaRESUMO
The current state of validation and implementation strategies of massively parallel sequencing (MPS) technology for the analysis of STR markers for forensic genetics use is described, covering the topics of the current catalog of commercial MPS-STR panels, leading MPS-platforms, and MPS-STR data analysis tools. In addition, the developmental and internal validation studies carried out to date to evaluate reliability, sensitivity, mixture analysis, concordance, and the ability to analyze challenged samples are summarized. The results of various MPS-STR population studies that showed a large number of new STR sequence variants that increase the power of discrimination in several forensically relevant loci are also presented. Finally, various initiatives developed by several international projects and standardization (or guidelines) groups to facilitate application of MPS technology for STR marker analyses are discussed in regard to promoting a standard STR sequence nomenclature, performing population studies to detect sequence variants, and developing a universal system to translate sequence variants into a simple STR nomenclature (numbers and letters) compatible with national STR databases.
Assuntos
DNA/genética , Genética Forense/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Alelos , Impressões Digitais de DNA/métodos , Bases de Dados de Ácidos Nucleicos , Genótipo , Humanos , Repetições de MicrossatélitesRESUMO
China has repeatedly been the subject of genetic studies to elucidate its prehistoric and historic demography. While some studies reported a genetic distinction between Northern and Southern Han Chinese, others showed a more clinal picture of small differences within China. Here, we investigated the distribution of Y chromosome variation along administrative as well as ethnic divisions in the mainland territory of the People's Republic of China, including 28 administrative regions and 19 recognized Chinese nationalities, to assess the impact of recent demographic processes. To this end, we analyzed 37,994 Y chromosomal 17-marker haplotype profiles from the YHRD database with respect to forensic diversity measures and genetic distance between groups defined by administrative boundaries and ethnic origin. We observed high diversity throughout all Chinese provinces and ethnicities. Some ethnicities, including most prominently Kazakhs and Tibetans, showed significant genetic differentiation from the Han and other groups. However, differences between provinces were, except for those located on the Tibetan plateau, less pronounced. This discrepancy is explicable by the sizeable presence of Han speakers, who showed high genetic homogeneity all across China, in nearly all studied provinces. Furthermore, we observed a continuous genetic North-South gradient in the Han, confirming previous reports of a clinal distribution of Y chromosome variation and being in notable concordance with the previously observed spatial distribution of autosomal variation. Our findings shed light on the demographic changes in China accrued by a fast-growing and increasingly mobile population.
Assuntos
Povo Asiático/genética , Cromossomos Humanos Y/genética , Haplótipos , China , Variação Genética , Genética Populacional , Técnicas de Genotipagem , Humanos , Masculino , Repetições de MicrossatélitesRESUMO
Numerous studies of human populations in Europe and Asia have revealed a concordance between their extant genetic structure and the prevailing regional pattern of geography and language. For native South Americans, however, such evidence has been lacking so far. Therefore, we examined the relationship between Y-chromosomal genotype on the one hand, and male geographic origin and linguistic affiliation on the other, in the largest study of South American natives to date in terms of sampled individuals and populations. A total of 1,011 individuals, representing 50 tribal populations from 81 settlements, were genotyped for up to 17 short tandem repeat (STR) markers and 16 single nucleotide polymorphisms (Y-SNPs), the latter resolving phylogenetic lineages Q and C. Virtually no structure became apparent for the extant Y-chromosomal genetic variation of South American males that could sensibly be related to their inter-tribal geographic and linguistic relationships. This continent-wide decoupling is consistent with a rapid peopling of the continent followed by long periods of isolation in small groups. Furthermore, for the first time, we identified a distinct geographical cluster of Y-SNP lineages C-M217 (C3*) in South America. Such haplotypes are virtually absent from North and Central America, but occur at high frequency in Asia. Together with the locally confined Y-STR autocorrelation observed in our study as a whole, the available data therefore suggest a late introduction of C3* into South America no more than 6,000 years ago, perhaps via coastal or trans-Pacific routes. Extensive simulations revealed that the observed lack of haplogroup C3* among extant North and Central American natives is only compatible with low levels of migration between the ancestor populations of C3* carriers and non-carriers. In summary, our data highlight the fact that a pronounced correlation between genetic and geographic/cultural structure can only be expected under very specific conditions, most of which are likely not to have been met by the ancestors of native South Americans.
Assuntos
Cromossomos Humanos Y/genética , Haplótipos/genética , Indígenas Sul-Americanos/genética , Repetições de Microssatélites/genética , América Central , Europa (Continente) , Genótipo , Geografia , Humanos , Idioma , Linguística , Masculino , Filogenia , Polimorfismo de Nucleotídeo Único , Grupos Populacionais/genética , América do SulRESUMO
To evaluate the promising advantages of massively parallel sequencing (MPS) in our casework, we analysed a total of 33 Y-chromosomal short tandem repeats (Y-STRs) with traditional capillary electrophoresis (CE) and 25 Y-STRs using the newer MPS technology. We studied the outcome of both technologies in 64 father-son pairs using stock and custom-designed kits. Current MPS technology confirmed the 13 mutational events observed with CE and improved our understanding of the complex nature of STR mutations. By detecting isometric sequence variants between unrelated males, we show that sequencing Y-STRs using MPS can boost discrimination power.
RESUMO
The distribution of Y-chromosomal short tandem repeat (Y-STR) haplotypes was determined in a population of Taiwanese Paiwan aboriginals. Using 17 Y-STR markers, a total of 135 haplotypes were observed, 102 of which were unique. The overall haplotype diversity for the 17 Y-STR loci tested was 0.9922 and the discrimination capacity was 0.6490. In addition, three novel intermediate alleles at the DYS448 locus were also found.
Assuntos
Cromossomos Humanos Y , Impressões Digitais de DNA , Etnicidade/genética , Haplótipos , Sequências de Repetição em Tandem , Frequência do Gene , Humanos , Masculino , Reação em Cadeia da Polimerase , TaiwanRESUMO
Forensic genetic laboratories perform a large amount of STR analyses of the Y chromosome, in particular to analyze the male part of complex DNA mixtures. However, the statistical interpretation of evidence retrieved from Y-STR haplotypes is challenging. Due to the uni-parental inheritance mode, Y-STR loci are connected to each other and thus haplotypes show patterns of relationship on the familial and population level. This precludes the treatment of Y-STR loci as independently inherited variables and the application of the product rule. Instead, the dependency structure of Y-STRs needs to be included in the haplotype frequency estimation process affecting also the current paradigm of a random match probability that is in the autosomal case approximated by the population frequency assuming unrelatedness of sampled individuals. Information on the degree of paternal relatedness in the suspect population as well as on the familial network is however needed to interpret Y-chromosomal results in the best possible way. The previous recommendations of the DNA commission of the ISFG on the use of Y-STRs in forensic analysis published more than a decade ago [1] cover the interpretation issue only marginally. The current recommendations address a number of topics (frequency estimators, databases, metapopulations, LR formulation, triage, rapidly mutating Y-STRs) with relevance for the Y-STR statistics and recommend a decision-based procedure, which takes into account legal requirements as well as availability of population data and statistical methods.
Assuntos
Cromossomos Humanos Y , Impressões Digitais de DNA/normas , Genética Forense/normas , Repetições de Microssatélites , Alelos , Bases de Dados Genéticas/normas , Genética Populacional , Haplótipos , Humanos , Modelos EstatísticosRESUMO
A total of 314 individuals representing the three major ethno-linguistic groups (ethnic Macedonians, Albanians and Turks) in the Republic of North Macedonia were analyzed for Y-SNPs and Y-STRs using minisequencing and fragment analysis. The haplogroup composition differed remarkably between the three groups with dominance of haplogroup I2 in ethnic Macedonians (28.1%), E1b in Albanians (35.3%) and J2a (34.9%) in Turks, respectively. The haplotype analysis using the YFilerPlus kit disclosed a significant reduction in diversity values (DC, GD) for the Turkish subgroup compared to the Macedonian and Albanian speaking populations. The Y-STR based population analysis revealed a similarity of ethnic Macedonians with neighboring Serbians and Bulgarians. The same holds true for the Albanian speakers from Macedonia and Albania, whereas the Turkish minority in North Macedonia stands apart from the population in Turkey.
Assuntos
Cromossomos Humanos Y , Etnicidade/genética , Genética Populacional , Repetições de Microssatélites , Polimorfismo de Nucleotídeo Único , Impressões Digitais de DNA , Eletroforese Capilar , Haplótipos , Humanos , Masculino , Filogenia , Reação em Cadeia da Polimerase , República da Macedônia do NorteRESUMO
Previous studies of Y chromosome variation have revealed that western Europe, the Volga-Ural region, and the Caucasus differ dramatically with respect to Y-SNP haplogroup composition. The European part of Russia is situated in between these three regions; to determine if these differences reflect clines or boundaries in the Y-chromosome landscape, we analyzed 12 Y-SNPs in 545 males from 12 populations from the European part of Russia. The majority of Russian Y chromosomes (from 74% to 94%) belong to three Y chromosomal lineages [I-M170, R1a1-M17, and N3-TAT] that are also frequent in the rest of east Europe, north Europe, and/or in the Volga-Ural region. We find significant but low correlations between haplogroup frequencies and the geographic location of populations, suggesting gradual change in the Y chromosome gene pool across western Eurasia. However, we also find some significant boundaries between populations, suggesting that both isolation and migration have influenced the Y chromosome landscape.
Assuntos
Cromossomos Humanos Y/genética , Variação Genética , Genética Populacional , População Branca/genética , Heterogeneidade Genética , Haplótipos , Humanos , Masculino , Polimorfismo de Nucleotídeo Único , Federação RussaRESUMO
The implementation of massively parallel sequencing (MPS) in forensic science revealed the advantages of the new method compared to the forensic benchmark in DNA-STR analysis, the capillary-electrophoresis (CE): Sequence information and the possibility to multiplex hundreds of markers in one multiplex PCR increase the discrimination power of a forensic (STR-) profile. The EU funded project DNASeqEx (DNA-STR Massive Sequencing & International Information Exchange) aims to evaluate MPS-based materials in their respective developmental stages using the two established platforms MiSeq FGx (Illumina) and Ion S5™ (Thermo Fisher Scientific). As part of this project, we present here an inter-laboratory validation of the Forenseq™ DNA Signature Prep Kit, focussing on STRs included in primer mix A. Our study comprises tests of concordance, reproducibility, sensitivity (1â¯ng, 500â¯pg, 250â¯pg, 125â¯pg, 63â¯pg, 31â¯pg) and mixtures (male-male and male-female at ratios of 1:1, 1:5, 1:10, 1:15, 1:20, 1:100, 1:500, 1:1000). Sequencing results found to be virtually concordant to CE results, to reference profiles and reproducible between duplicates and between both laboratories. We observed first locus drop-outs (LDO) at a DNA input of 63â¯pg (20 sample pool) and 125â¯pg (38 sample pool). Alleles were found to be well balanced at a DNA input of 250â¯pg or more. We found the kit to perform well on moderate mixtures (1:1-1:20).
Assuntos
Genética Forense/instrumentação , Sequenciamento de Nucleotídeos em Larga Escala , Repetições de Microssatélites , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA , Alelos , Impressões Digitais de DNA , Feminino , Humanos , Masculino , Reação em Cadeia da Polimerase , Reprodutibilidade dos TestesRESUMO
The DNA Commission of the International Society for Forensic Genetics (ISFG) is reviewing factors that need to be considered ahead of the adoption by the forensic community of short tandem repeat (STR) genotyping by massively parallel sequencing (MPS) technologies. MPS produces sequence data that provide a precise description of the repeat allele structure of a STR marker and variants that may reside in the flanking areas of the repeat region. When a STR contains a complex arrangement of repeat motifs, the level of genetic polymorphism revealed by the sequence data can increase substantially. As repeat structures can be complex and include substitutions, insertions, deletions, variable tandem repeat arrangements of multiple nucleotide motifs, and flanking region SNPs, established capillary electrophoresis (CE) allele descriptions must be supplemented by a new system of STR allele nomenclature, which retains backward compatibility with the CE data that currently populate national DNA databases and that will continue to be produced for the coming years. Thus, there is a pressing need to produce a standardized framework for describing complex sequences that enable comparison with currently used repeat allele nomenclature derived from conventional CE systems. It is important to discern three levels of information in hierarchical order (i) the sequence, (ii) the alignment, and (iii) the nomenclature of STR sequence data. We propose a sequence (text) string format the minimal requirement of data storage that laboratories should follow when adopting MPS of STRs. We further discuss the variant annotation and sequence comparison framework necessary to maintain compatibility among established and future data. This system must be easy to use and interpret by the DNA specialist, based on a universally accessible genome assembly, and in place before the uptake of MPS by the general forensic community starts to generate sequence data on a large scale. While the established nomenclature for CE-based STR analysis will remain unchanged in the future, the nomenclature of sequence-based STR genotypes will need to follow updated rules and be generated by expert systems that translate MPS sequences to match CE conventions in order to guarantee compatibility between the different generations of STR data.
Assuntos
Genética Forense/métodos , Genética Forense/normas , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Terminologia como Assunto , DNA/genética , Bases de Dados de Ácidos Nucleicos/normas , Genótipo , Humanos , Repetições de Microssatélites/genética , Polimorfismo GenéticoRESUMO
Eleven Y-chromosomal short tandem repeats (STRs), DYS19, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS385, DYS437, DYS438, DYS439 were typed in DNA samples from Macedonian population (n = 150).
Assuntos
Cromossomos Humanos Y , Genética Populacional , Haplótipos , Sequências de Repetição em Tandem , Impressões Digitais de DNA/métodos , Frequência do Gene , Humanos , Masculino , Reação em Cadeia da Polimerase , República da Macedônia do NorteRESUMO
After opening the first version of an internet-accessible worldwide reference database of Y chromosome profiles 14 years ago and six years after the last major relaunch the new YHRD 4.0 repository and website has been rolled-out. By November 2014 about 136k 9-locus haplotypes, among these 84k 17-locus haplotypes, 25k 23-locus haplotypes and 15k Y SNP profiles from 917 sampling locations in 128 countries have been submitted by more than 250 institutes and laboratories. In geographic terms, about 39% of the YHRD samples are from Europe, 32% from Asia, 16% from South America, 6% from North America, 4% from Africa and 2% from Oceania/Australia. Worldwide collaboration is the driving force for the rapid growth of the database and this, in turn, allows the evaluation and implementation of enhanced interpretation tools (variable frequency estimators, LR-based mixture and kinship analysis, Y-SNP-based ancestry assessment).
Assuntos
Cromossomos Humanos Y , Bases de Dados Genéticas , Haplótipos , Humanos , InternetRESUMO
Match probability calculation is deemed much more intricate for lineage genetic markers, including Y-chromosomal short tandem repeats (Y-STRs), than for autosomal markers. This is because, owing to the lack of recombination, strong interdependence between markers is likely, which implies that haplotype frequency estimates cannot simply be obtained through the multiplication of allele frequency estimates. As yet, however, the practical relevance of this problem has not been studied in much detail using real data. In fact, such scrutiny appears well warranted because the high mutation rates of Y-STRs and the possibility of backward mutation should have worked against the statistical association of Y-STRs. We examined haplotype data of 21 markers included in the PowerPlex(®)Y23 set (PPY23, Promega Corporation, Madison, WI) originating from six different populations (four European and two Asian). Assessing the conditional entropies of the markers, given different subsets of markers from the same panel, we demonstrate that the PowerPlex(®)Y23 set cannot be decomposed into smaller marker subsets that would be (conditionally) independent. Nevertheless, in all six populations, >94% of the joint entropy of the 21 markers is explained by the seven most rapidly mutating markers. Although this result might render a reduction in marker number a sensible option for practical casework, the partial haplotypes would still be almost as diverse as the full haplotypes. Therefore, match probability calculation remains difficult and calls for the improvement of currently available methods of haplotype frequency estimation.
Assuntos
Cromossomos Humanos Y , Repetições de Microssatélites/genética , Haplótipos , Humanos , ProbabilidadeRESUMO
For several years Y-chromosomal microsatellites (short tandem repeats, STRs) have been well established in forensic practice. In this context, the genetic characteristics of the Y chromosome (i.e. its paternal inheritance and lack of recombination) render STRs particularly powerful. However, genetic differences between male populations appear to be larger for Y-STRs than for autosomal STRs, a fact that is most likely due to the higher sensitivity of Y-chromosomal lineages to genetic drift (Forensic Sci Int 118 (2001) 153). The assessment of probabilities for matches between haplotyped male persons or traces/persons requires the typing of a large number of haplotypes in the appropriate reference populations. The haplotype data of a large number of European as well as South and North American populations have been collected and are continuously published online (Y-STR Haplotype Reference Database--YHRD; http://www.ystr.org). The most recent multicentric effort has led to the establishment of an Asian YHRD (http://www.ystr.org/asia) which has been available since January 2002. All databases are maintained and curated at the Institute of Legal Medicine, Humboldt-University, Berlin and will soon be fused to a global repository including populations from all continents.
Assuntos
Povo Asiático/genética , Cromossomos Humanos Y , Bases de Dados Factuais , Genética Populacional , Haplótipos , Sequências de Repetição em Tandem , Ásia , Impressões Digitais de DNA , Marcadores Genéticos , Humanos , InternetRESUMO
We describe here an online Y-chromosomal short tandem repeat haplotype reference database (YHRD) for U.S. populations, which represents 9-locus Y-STR haplotypes for 1705 African-Americans, European-Americans and Hispanics as of October 2001. This database is available online (http://www.ystr. org/usa/), free to access and was generated in order to supply the U.S. forensic DNA community with a valuable resource for frequencies of complete or incomplete 9-locus Y-STR haplotypes, as well as information about typing protocols and population genetic analyses. Pairwise R(ST)-statistics derived from the Y-STR haplotypes indicate no significant substructure among African-American populations from different regions of the U.S., nor (usually) among European-American and Hispanic populations. Thus, pooling of Y-STR haplotype data from regional populations within these three major groups is appropriate in order to obtain larger sample sizes. However, pooling of different major populations is generally not recommended due to statistically significant differences between African-American populations and all European-American/Hispanic populations, as well as between some European-American and Hispanic populations.
Assuntos
Bases de Dados Genéticas , Haplótipos/genética , Sequências de Repetição em Tandem , Cromossomo Y/genética , População Negra/genética , DNA/química , DNA/genética , Variação Genética , Hispânico ou Latino/genética , Humanos , Masculino , Padrões de Referência , Estados Unidos , População Branca/genética , Cromossomo Y/químicaRESUMO
In a worldwide collaborative effort, 19,630 Y-chromosomes were sampled from 129 different populations in 51 countries. These chromosomes were typed for 23 short-tandem repeat (STR) loci (DYS19, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS385ab, DYS437, DYS438, DYS439, DYS448, DYS456, DYS458, DYS635, GATAH4, DYS481, DYS533, DYS549, DYS570, DYS576, and DYS643) and using the PowerPlex Y23 System (PPY23, Promega Corporation, Madison, WI). Locus-specific allelic spectra of these markers were determined and a consistently high level of allelic diversity was observed. A considerable number of null, duplicate and off-ladder alleles were revealed. Standard single-locus and haplotype-based parameters were calculated and compared between subsets of Y-STR markers established for forensic casework. The PPY23 marker set provides substantially stronger discriminatory power than other available kits but at the same time reveals the same general patterns of population structure as other marker sets. A strong correlation was observed between the number of Y-STRs included in a marker set and some of the forensic parameters under study. Interestingly a weak but consistent trend toward smaller genetic distances resulting from larger numbers of markers became apparent.
Assuntos
Cromossomos Humanos Y , Haplótipos , Repetições de Microssatélites , Alelos , Genética Forense , HumanosRESUMO
Estimation of match probabilities for singleton haplotypes of lineage markers, i.e. for haplotypes observed only once in a reference database augmented by a suspect profile, is an important problem in forensic genetics. We compared the performance of four estimators of singleton match probabilities for Y-STRs, namely the count estimate, both with and without Brenner's so-called 'kappa correction', the surveying estimate, and a previously proposed, but rarely used, coalescent-based approach implemented in the BATWING software. Extensive simulation with BATWING of the underlying population history, haplotype evolution and subsequent database sampling revealed that the coalescent-based approach is characterized by lower bias and lower mean squared error than the uncorrected count estimator and the surveying estimator. Moreover, in contrast to the two count estimators, both the surveying and the coalescent-based approach exhibited a good correlation between the estimated and true match probabilities. However, although its overall performance is thus better than that of any other recognized method, the coalescent-based estimator is still computation-intense on the verge of general impracticability. Its application in forensic practice therefore will have to be limited to small reference databases, or to isolated cases of particular interest, until more powerful algorithms for coalescent simulation have become available.
Assuntos
Cromossomos Humanos Y , Haplótipos , Repetições de Microssatélites/genética , Probabilidade , Humanos , Modelos GenéticosRESUMO
Jörg Jenatsch was a Swiss defender of independence and a fighter for liberty in the 17th century. With the help of three living male members of the Jenatsch family, we successfully identified a skeleton exhumed from Chur cathedral as the remains of Jörg Jenatsch. Our conclusion was based upon complete Y-STR and Y-SNP profiles that could be generated by replicate analyses of a bone sample available to us. The skeleton and the three living family members carried the same Y-SNP haplogroup, but were discordant at three of 23 Y-STR loci. This notwithstanding, conservative biostatistical evaluation of the data suggests that the Chur skeleton is at least 20 times more likely than not to be Jörg Jenatsch.