RESUMO
BACKGROUND: As a population genetic tool, mitochondrial DNA is commonly divided into the ~ 1-kb control region (CR), in which single nucleotide variant (SNV) diversity is relatively high, and the coding region, in which selective constraint is greater and diversity lower, but which provides an informative phylogeny. In some species, the CR contains variable tandemly repeated sequences that are understudied due to heteroplasmy. Domestic cats (Felis catus) have a recent origin and therefore traditional CR-based analysis of populations yields only a small number of haplotypes. RESULTS: To increase resolution we used Nanopore sequencing to analyse 119 cat mitogenomes via a long-amplicon approach. This greatly improves discrimination (from 15 to 87 distinct haplotypes in our dataset) and defines a phylogeny showing similar starlike topologies within all major clades (haplogroups), likely reflecting post-domestication expansion. We sequenced RS2, a CR tandem array of 80-bp repeat units, placing RS2 array structures within the phylogeny and increasing overall haplotype diversity. Repeat number varies between 3 and 12 (median: 4) with over 30 different repeat unit types differing largely by SNVs. Five SNVs show evidence of independent recurrence within the phylogeny, and seven are involved in at least 11 instances of rapid spread along repeat arrays within haplogroups. CONCLUSIONS: In defining mitogenome variation our study provides key information for the forensic genetic analysis of cat hair evidence, and for the first time a phylogenetically informed picture of tandem repeat variation that reveals remarkably dynamic mutation processes at work in the mitochondrion.
Assuntos
Genoma Mitocondrial , Gatos/genética , Animais , Variação Genética , Repetições Minissatélites/genética , Mitocôndrias , MutaçãoRESUMO
The human X and Y chromosomes are heteromorphic but share a region of homology at the tips of their short arms, pseudoautosomal region 1 (PAR1), that supports obligate crossover in male meiosis. Although the boundary between pseudoautosomal and sex-specific DNA has traditionally been regarded as conserved among primates, it was recently discovered that the boundary position varies among human males, due to a translocation of ~110 kb from the X to the Y chromosome that creates an extended PAR1 (ePAR). This event has occurred at least twice in human evolution. So far, only limited evidence has been presented to suggest this extension is recombinationally active. Here, we sought direct proof by examining thousands of gametes from each of two ePAR-carrying men, for two subregions chosen on the basis of previously published male X-chromosomal meiotic double-strand break (DSB) maps. Crossover activity comparable to that seen at autosomal hotspots was observed between the X and the ePAR borne on the Y chromosome both at a distal and a proximal site within the 110-kb extension. Other hallmarks of classic recombination hotspots included evidence of transmission distortion and GC-biased gene conversion. We observed good correspondence between the male DSB clusters and historical recombination activity of this region in the X chromosomes of females, as ascertained from linkage disequilibrium analysis; this suggests that this region is similarly primed for crossover in both male and female germlines, although sex-specific differences may also exist. Extensive resequencing and inference of ePAR haplotypes, placed in the framework of the Y phylogeny as ascertained by both Y microsatellites and single nucleotide polymorphisms, allowed us to estimate a minimum rate of crossover over the entire ePAR region of 6-fold greater than genome average, comparable with pedigree estimates of PAR1 activity generally. We conclude ePAR very likely contributes to the critical crossover function of PAR1.
Assuntos
Troca Genética/genética , Regiões Pseudoautossômicas/genética , Adulto , Mapeamento Cromossômico/métodos , Cromossomos , Cromossomos Humanos X/genética , Cromossomos Humanos Y/genética , Quebras de DNA de Cadeia Dupla , Ligação Genética , Genoma , Haplótipos , Humanos , Masculino , Polimorfismo de Nucleotídeo Único/genética , Recombinação Genética/genética , Espermatozoides/citologiaRESUMO
We have determined the distribution of Y-chromosomal haplotypes and predicted haplogroups in the ethnically diverse Kingdom of Bahrain, a small archipelago in the Arabian Gulf. Paternal population structure within Bahrain was investigated using the 27 Y-STRs (short tandem repeats) in the Yfiler Plus kit to generate haplotypes from 562 unrelated Bahraini males, sub-divided into four geographical regions-Northern, Capital, Southern and Muharraq. Yfiler Plus provided a significant improvement over the 17-locus Yfiler kit in discrimination capacity (from 77% to 87.5% overall), but discrimination capacity differed widely between regions from 98.4% in Muharraq to 75.2% in the Northern region, an unusually low value possibly resulting from recent rapid population expansion. Clusters of closely related male lineages were seen, with only 79.4% of donors displaying unique haplotypes and 59% of instances of shared haplotypes occurring within, rather than between, regions. Haplogroup prediction indicated diverse origins of the population with a predominance of haplogroups J2 and J1, both typical of the Arabian Peninsula, but also haplogroups such as B2 and E1b1a likely originating in Africa, and H, L and R2 likely indicative of migration from South Asia. Haplogroup frequencies differed significantly between regions, with J2 significantly more common in the Northern region compared with the Southern, possibly due to differential settlement by Baharna and Arabs. Our study shows that paternal lineage population structure can exist even over small geographical scales, and that highly discriminating genetic tools are required where rapid expansions have occurred within tightly bounded populations.
Assuntos
Cromossomos Humanos Y/genética , Etnicidade/genética , Variação Genética , Genética Populacional , Haplótipos , Repetições de Microssatélites , Adulto , Barein , Humanos , Masculino , Pessoa de Meia-Idade , Adulto JovemRESUMO
In developed countries, DNA profiling routinely forms part of the forensic strategy in the investigation of sexual violence. Medical examinations provide opportunities for recovering DNA evidence from intimate swabs, which can be particularly probative in cases where the identity of the perpetrator is unknown and proof of intercourse between two people is required. In low-resource environments, such as developing countries, remote geographic locations, conflict (and post-conflict) affected regions and displaced communities where access to medical examinations is lacking, DNA evidence is not available to support prosecutions and perpetrators are rarely identified and held accountable for crimes of sexual violence. This paper reports the results of a proof-of-concept study testing the efficacy of a novel self-examination intimate swab designed for recovering DNA following unprotected sexual intercourse. The results of this study corroborate previous research which has demonstrated that male DNA profiles can be successfully recovered by post-coital, self-examination methods, and discusses how this novel approach could enable the integration of DNA evidence into victim-centred approaches to investigating and prosecuting sexual violence in low-resource environments. The results and discussion challenge the prevailing assumption that intimate DNA swabs must be collected by trained medical professionals in order to be of evidential value.
Assuntos
DNA/isolamento & purificação , Estupro , Autoexame , Manejo de Espécimes/instrumentação , Cromossomos Humanos Y , Impressões Digitais de DNA , Países em Desenvolvimento , Feminino , Humanos , Masculino , Repetições de Microssatélites , Reação em Cadeia da PolimeraseRESUMO
Many studies of human populations have used the male-specific region of the Y chromosome (MSY) as a marker, but MSY sequence variants have traditionally been subject to ascertainment bias. Also, dating of haplogroups has relied on Y-specific short tandem repeats (STRs), involving problems of mutation rate choice, and possible long-term mutation saturation. Next-generation sequencing can ascertain single nucleotide polymorphisms (SNPs) in an unbiased way, leading to phylogenies in which branch-lengths are proportional to time, and allowing the times-to-most-recent-common-ancestor (TMRCAs) of nodes to be estimated directly. Here we describe the sequencing of 3.7 Mb of MSY in each of 448 human males at a mean coverage of 51×, yielding 13,261 high-confidence SNPs, 65.9% of which are previously unreported. The resulting phylogeny covers the majority of the known clades, provides date estimates of nodes, and constitutes a robust evolutionary framework for analyzing the history of other classes of mutation. Different clades within the tree show subtle but significant differences in branch lengths to the root. We also apply a set of 23 Y-STRs to the same samples, allowing SNP- and STR-based diversity and TMRCA estimates to be systematically compared. Ongoing purifying selection is suggested by our analysis of the phylogenetic distribution of nonsynonymous variants in 15 MSY single-copy genes.
Assuntos
Cromossomos Humanos Y/genética , Polimorfismo de Nucleotídeo Único/genética , Evolução Molecular , Projeto HapMap , Humanos , Masculino , Filogenia , Análise de Sequência de DNARESUMO
BACKGROUND: Great apes are a global conservation concern, with anthropogenic pressures threatening their survival. Genetic analysis can be used to assess the effects of reduced population sizes and the effectiveness of conservation measures. In humans, autosomal short tandem repeats (aSTRs) are widely used in population genetics and for forensic individual identification and kinship testing. Traditionally, genotyping is length-based via capillary electrophoresis (CE), but there is an increasing move to direct analysis by massively parallel sequencing (MPS). An example is the ForenSeq DNA Signature Prep Kit, which amplifies multiple loci including 27 aSTRs, prior to sequencing via Illumina technology. Here we assess the applicability of this human-based kit in African great apes. We ask whether cross-species genotyping of the orthologs of these loci can provide both individual and (sub)species identification. RESULTS: The ForenSeq kit was used to amplify and sequence aSTRs in 52 individuals (14 chimpanzees; 4 bonobos; 16 western lowland, 6 eastern lowland, and 12 mountain gorillas). The orthologs of 24/27 human aSTRs amplified across species, and a core set of thirteen loci could be genotyped in all individuals. Genotypes were individually and (sub)species identifying. Both allelic diversity and the power to discriminate (sub)species were greater when considering STR sequences rather than allele lengths. Comparing human and African great-ape STR sequences with an orangutan outgroup showed general conservation of repeat types and allele size ranges. Variation in repeat array structures and a weak relationship with the known phylogeny suggests stochastic origins of mutations giving rise to diverse imperfect repeat arrays. Interruptions within long repeat arrays in African great apes do not appear to reduce allelic diversity. CONCLUSIONS: Orthologs of most human aSTRs in the ForenSeq DNA Signature Prep Kit can be analysed in African great apes. Primer redesign would reduce observed variability in amplification across some loci. MPS of the orthologs of human loci provides better resolution for both individual and (sub)species identification in great apes than standard CE-based approaches, and has the further advantage that there is no need to limit the number and size ranges of analysed loci.
Assuntos
Hominidae , Repetições de Microssatélites , Animais , Repetições de Microssatélites/genética , Hominidae/genética , Humanos , Pan paniscus/genética , Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de DNA/métodos , Gorilla gorilla/genética , Genótipo , Genética Forense/métodosRESUMO
Hair shed by domestic cats is a potentially useful source of forensic evidence. Analysable hair DNA is predominantly mitochondrial, but the recent domestication history of cats means that mtDNA diversity is low. A 402-bp control region segment is usually sequenced, defining only a small number of distinct haplotypes in populations. Previously, we used a long-amplicon approach to sequence whole mitogenomes in a sample of blood DNAs from 119 UK cats, greatly increasing observed diversity and reducing random match probabilities. To exploit this variation for forensic analysis, we here describe a multiplex system that amplifies the cat mitogenome in 60 overlapping amplicons of mean length 360 bp, followed by Nanopore sequencing. Variants detected in multiplex sequence data from unrooted hair completely mirror those from long-amplicon data from blood from the same individuals. However, applying the multiplex to matched blood DNA reveals additional sequence variants which derive from the major feline nuclear mitochondrial insertion sequence (numt), which covers 7.9 kb of the 17-kb mitogenome and exists in multiple tandem copies. We use long-amplicon Nanopore sequencing to investigate numt variation in a set of cats, together with an analysis of published genome sequences, and show that numt arrays are variable in both structure and sequence, thus providing a potential source of uncertainty when nuclear DNA predominates in a sample. Forensic application of the multiplex was demonstrated by matching hairs from a cat with skeletal remains from its putative mother, both of which shared a globally common haplotype at the control region. The random match probability in this case with the CR 402-bp segment was 0.21 and this decreased to 0.03 when considering the whole mitogenome. The developed multiplex and sequencing approach, when applied to cat hair where nuclear DNA is scarce, can provide a reliable and highly discriminating source of forensic genetic evidence from a single hair. The confounding effect of numt co-amplification in degraded samples where mixed sequences are observed can be mitigated by variant phasing, and by comparison with numt sequence diversity data, such as those presented here.
Assuntos
Genoma Mitocondrial , Sequenciamento por Nanoporos , Animais , Gatos/genética , Humanos , DNA Mitocondrial/genética , Medicina Legal , Análise de Sequência de DNARESUMO
Short tandem repeat (STR) polymorphisms are traditionally assessed by measuring allele lengths via capillary electrophoresis (CE). Massively parallel sequencing (MPS) reveals differences among alleles of the same length, thus improving discrimination, but also identifying groups of alleles likely related by descent. These may have relatively restricted geographical distributions and thus MPS could detect population structure more effectively than CE-based analysis. We addressed this question by applying an MPS multiplex, the Promega PowerSeq™ Auto/Mito/Y System prototype, to 362 individuals chosen to represent a wide geographical spread from the People of the British Isles (PoBI) cohort, which represents at least three generations of local rural ancestry. As well as 22 autosomal STRs (aSTRs; equivalent to PowerPlex Fusion loci) the system sequences 23 Y-STRs (the PowerPlexY 23 loci) and the control region (CR) of mitochondrial DNA (mtDNA), allowing population structure to be compared across biparentally and uniparentally inherited segments of the genome. For all loci, FST-based tests of population structure were done based on historical, linguistic, and geographical partitions, and for aSTRs the clustering algorithm STRUCTURE was also applied. STRs were considered using both length and sequence. Sequencing increased aSTR allele diversity by 87.5% compared to CE-based designations, reducing random match probability to 1.25E-30, compared to a CE-based 6.72E-27. Significant population structure was detectable in just one pairwise comparison (Central/South East England compared to the rest), and for sequence-based alleles only. The 362 samples carried 308 distinct mtDNA CR haplotypes corresponding to 13 broad haplogroups, representing a haplotype diversity of 0.9985 ( ± 0.0005), and a haplotype match probability of 0.0043. No significant population structure was observed. Y-STR haplotypes belonged to ten broad predicted Y-haplogroups. Allele diversity increased by 33% when considered at the sequence rather than length level, although haplotype diversity was unchanged at 0.999969 ( ± 0.000001); haplotype match probability was 2.79E-03. In contrast to the biparentally and maternally inherited loci, Y-STR haplotypes showed significant population structure at several levels, but most markedly in a comparison of regions subject to Anglo-Saxon influence in the east with the rest of the sample. This was evident for both length- and sequence-based allele designations, with no systematic difference between the two. We conclude that MPS analysis of aSTRs or Y-STRs does not generally reveal stronger population structure than length-based analysis, that UK maternal lineages are not significantly structured, and that Y-STR haplotypes reveal significant population structure that may reflect the Anglo-Saxon migrations to Britain in the 6th century.
Assuntos
Cromossomos Humanos Y , Impressões Digitais de DNA , DNA Mitocondrial/genética , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Repetições de Microssatélites , Análise de Sequência de DNARESUMO
Birds of prey have suffered persecution for centuries through trapping, shooting, poisoning and theft from the wild to meet the demand from egg collectors and falconers; they were also amongst the earliest beneficiaries of DNA testing in wildlife forensics. Here we report the identification and characterisation of 14 novel tetramer, pentamer and hexamer short tandem repeat (STR) markers which can be typed either by capillary electrophoresis or massively parallel sequencing (MPS) and apply them to historical casework samples involving 49 peregrine falcons, 30 of which were claimed to be the captively bred offspring of nine pairs. The birds were initially tested in 1994 with a multilocus DNA fingerprinting probe, a sex test and eight single-locus minisatellite probes (SLPs) demonstrating that 23 birds were unrelated to the claimed parents. The multilocus and SLP approaches were highly discriminating but extremely time consuming and required microgram quantities of high molecular weight DNA and the use of radioisotopes. The STR markers displayed between 2 and 21 alleles per locus (mean = 7.6), lengths between 140 and 360 bp, and heterozygosities from 0.4 to 0.93. They produced wholly concordant conclusions with similar discrimination power but in a fraction of the time using a hundred-fold less DNA and with standard forensic equipment. Furthermore, eleven of these STRs were amplified in a single reaction and typed using MPS on the Illumina MiSeq platform revealing eight additional alleles (three with variant repeat structures and five solely due to flanking SNPs) across four loci. This approach gave a random match probability of < 1E-9, and a parental pair false inclusion probability of < 1E-5, with a further ten-fold reduction in the amount of DNA required (~3 ng) and the potential to analyse mixed samples. These STRs will be of value in monitoring wild populations of these key indicator species as well as for testing captive breeding claims and establishing a database of captive raptors. They have the potential to resolve complex cases involving trace, mixed and degraded samples from raptor persecution casework representing a significant advance over the previously applied methods.
Assuntos
Animais Selvagens , Repetições Minissatélites , Animais , Crime , DNA/genética , Impressões Digitais de DNA , Eletroforese Capilar , Sequenciamento de Nucleotídeos em Larga Escala , Repetições de Microssatélites , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNARESUMO
Kenya is a diverse and populous nation that employs DNA evidence in its criminal justice system, and therefore requires reliable information on autosomal STR allele frequency variation across the country and in its many ethnic groups. In order to provide reference data and to assess population structure, we analysed the 21 autosomal STRs in the GlobalFiler multiplex in a sample of 510 indigenous Kenyans representing the country's eight former provinces, 43 of its 47 counties, three main linguistic families and all 29 ethnic groups that each comprise >0.5% of the 2019 census population. The indigenous population originated from successive migrations of Cushitic, Nilotic and Bantu speaking groups who settled in regions that suited their distinctive sustenance lifestyles. Consequently, they now largely reside in a patchwork of communities with strong associations with particular counties and provinces and limited degrees of inter-group marriage, as shown by DNA donors' ancestry details. We found significant genetic differentiation between the three Nilotic language sub-families, with Western Nilotes (the Luo ethnic group) showing greater similarity to the Bantu than the Southern and Eastern Nilotes which themselves showed closer affinity to the Cushitic speakers. This concurs with previous genetic, linguistic and social studies. Comparisons with other African populations also showed that linguistic affiliation is a stronger factor than geography. This study revealed several rare off-ladder alleles whose structure was determined by Sanger sequencing. Among the unusual features that could affect profile interpretation were a deletion of Amelogenin Y but no other forensic marker (autosomal or Y-chromosomal), a triallelic pattern at TPOX and an extremely short SE33 allele falling within the expected size range of D7S820. Compared with the currently implemented Identifiler multiplex, Random Match Probabilities decreased from 6.4 × 10-19 to 3.9 × 10-27. The appreciation of local population structure provided by the geographically and ethnically representative sample in this study highlights the structured genetic landscape of Kenya.
Assuntos
Etnicidade/genética , Genética Populacional , Idioma , Repetições de Microssatélites , Filogeografia , DNA/genética , Frequência do Gene , Genótipo , Humanos , Quênia , Linguística , Masculino , Reação em Cadeia da Polimerase , Análise de Sequência de DNARESUMO
The influence of Viking-Age migrants to the British Isles is obvious in archaeological and place-names evidence, but their demographic impact has been unclear. Autosomal genetic analyses support Norse Viking contributions to parts of Britain, but show no signal corresponding to the Danelaw, the region under Scandinavian administrative control from the ninth to eleventh centuries. Y-chromosome haplogroup R1a1 has been considered as a possible marker for Viking migrations because of its high frequency in peninsular Scandinavia (Norway and Sweden). Here we select ten Y-SNPs to discriminate informatively among hg R1a1 sub-haplogroups in Europe, analyse these in 619 hg R1a1 Y chromosomes including 163 from the British Isles, and also type 23 short-tandem repeats (Y-STRs) to assess internal diversity. We find three specifically Western-European sub-haplogroups, two of which predominate in Norway and Sweden, and are also found in Britain; star-like features in the STR networks of these lineages indicate histories of expansion. We ask whether geographical distributions of hg R1a1 overall, and of the two sub-lineages in particular, correlate with regions of Scandinavian influence within Britain. Neither shows any frequency difference between regions that have higher (≥10%) or lower autosomal contributions from Norway and Sweden, but both are significantly overrepresented in the region corresponding to the Danelaw. These differences between autosomal and Y-chromosomal histories suggest either male-specific contribution, or the influence of patrilocality. Comparison of modern DNA with recently available ancient DNA data supports the interpretation that two sub-lineages of hg R1a1 spread with the Vikings from peninsular Scandinavia.
Assuntos
Cromossomos Humanos Y/genética , Haplótipos , Migração Humana , Evolução Molecular , Humanos , Masculino , Repetições Minissatélites , Linhagem , Polimorfismo de Nucleotídeo Único , Países Escandinavos e Nórdicos , Reino UnidoRESUMO
Massively parallel sequencing (MPS) of forensic STRs has the potential to reveal additional allele diversity compared to conventional capillary electrophoresis (CE) typing strategies, but population studies are currently relatively few in number. The Verogen ForenSeq™ DNA Signature Prep Kit includes both Y-STRs and X-STRs among its targeted loci, and here we report the sequences of these loci, analysed using Verogen's ForenSeq™ Universal Analysis Software (UAS) v1.3 and STRait Razor v3.0, in a representative sample of 89 Saudi Arabian males. We identified 56 length variants (equivalent to CE alleles) and 75 repeat sequence sub-variants across the six X-STRs analysed; equivalent figures for the set of 24 Y-STRs were 147 and 192 respectively. We also observed two flanking sequence variants for the X-, and six for the Y-STRs. Recovery of sequence data and concordance with CE data (where available) across the tested loci was good, though rare flanking variation affected interpretation and allele calling at DYF387S1 and DXS7132. Examination of flanking sequences of the Y-STRs revealed five SNPs (L255, M4790, BY7692, Z16708 and S17543) previously shown to define specific haplogroups by Y-chromosome sequencing. These define Y-haplogroups in 62 % of our sample, a proportion that increases to 91 % when haplogroup-associated repeat-sequence motifs are also considered. A population-level comparison of the Saudi Arabian X-STRs with a global sample showed our dataset to be part of a large cluster of populations of West Eurasian and Middle Eastern origin.
Assuntos
Cromossomos Humanos X , Cromossomos Humanos Y , Sequenciamento de Nucleotídeos em Larga Escala , Repetições de Microssatélites , Impressões Digitais de DNA , Eletroforese Capilar , Humanos , Masculino , Polimorfismo de Nucleotídeo Único , Arábia Saudita , Análise de Sequência de DNARESUMO
The Y-STR DYS19 is firmly established in the repertoire of Y-chromosomal markers used in forensic analysis yet is poorly understood at the molecular level, lying in a complex genomic environment and exhibiting null alleles, as well as duplications and occasional triplications in population samples. Here, we analyse three null alleles and 51 duplications and show that DYS19 can also be involved in inversion events, so that even its location within the short arm of the Y chromosome is uncertain. Deletion mapping in the three chromosomes carrying null alleles shows that their deletions are less than approximately 300 kb in size. Haplotypic analysis with binary markers shows that they belong to three different haplogroups and so represent independent events. In contrast, a collection of 51 DYS19 duplication chromosomes belong to only four haplogroups: two are singletons and may represent somatic mutation in lymphoblastoid cell lines, but two, in haplogroups G and C3c, represent founder lineages that have spread widely in Central Europe/West Asia and East Asia, respectively. Consideration of candidate mechanisms underlying both deletions and duplications provides no evidence for the involvement of non-allelic homologous recombination, and they are likely to represent sporadic events with low mutation rates. Understanding the basis and population distribution of these DYS19 alleles will aid in the utilisation and interpretation of profiles that contain them.
Assuntos
Deleção Cromossômica , Inversão Cromossômica , Cromossomos Humanos Y , Duplicação Gênica , Sequências de Repetição em Tandem , Mapeamento Cromossômico , Eletroforese em Gel de Ágar , Genética Forense , Marcadores Genéticos , Haplótipos , Humanos , Masculino , Reação em Cadeia da PolimeraseRESUMO
Sequence analysis of the mitochondrial DNA (mtDNA) control region can provide forensically useful information, particularly in challenging samples where autosomal DNA profiling fails. Sub-division of the 1122-bp region into shorter PCR fragments improves data recovery, and such fragments can be analysed together via massively parallel sequencing (MPS). Here, we generate mtDNA data using the prototype PowerSeq™ Auto/Mito/Y System (Promega) MPS assay, in which a single PCR reaction amplifies ten overlapping amplicons of the control region, in a set of 101 highly diverse samples representing most major clades of the mtDNA phylogeny. The overlapping multiplex design leads to non-uniform coverage in the regions of overlap, where it is further increased by short amplicons generated alongside the intended products. Primer sequences in targeted amplification libraries are a potential source of reference sequence bias and thus should be removed, but the proprietary nature of the primers in commercial kits necessitates an alternative approach that minimises data loss: here, we introduce the bioinformatic selection of sequencing reads spanning putative primer sites (Overarching Read Enrichment Option, OREO). While OREO performs well in mitigating the effects of primer sequences at the ends of sequence reads, we still find evidence of the internalisation of primer-derived sequences by overlap extension, which may compromise the ability to call variants or to measure heteroplasmy in primer-binding regions. The commercially available PowerSeq™ CRM Nested System design prevents primer internalisation, as shown in a reanalysis of a subset of 57 samples that contain possible heteroplasmies. In combination with OREO, the CRM Nested kit mitigates reference sequence bias, allowing heteroplasmic variants to be estimated down to a 5% threshold. Provided appropriate steps are taken in data processing, single-reaction multiplex assays represent robust tools to analyse mtDNA control region variation. The OREO approach will allow users to bypass the effects of unknown primer sequences in any single-reaction tiled multiplex and eliminate primer-derived bias in overlapping amplicon sequencing studies, in both forensic and non-forensic settings.
Assuntos
DNA Mitocondrial/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA , Impressões Digitais de DNA , Humanos , Filogenia , Reação em Cadeia da Polimerase , Polimorfismo de Nucleotídeo ÚnicoRESUMO
The increasing use of non-laboratory-based DNA and protein detection methods promise to provide rapid investigative intelligence and support sample prioritisation. Primarily developed for human forensic or medical applications, current systems may also show utility in the field of wildlife forensic science. However, it is currently unknown whether the requirements of the wildlife forensic community can be met by current non-laboratory based tools. Given the diverse array of stakeholders and sample types commonly encountered, it is necessary to first identify the needs of the community and then try and map their needs to current instrumentation. By using a market research style questionnaire, this study identified key requirements for a non-laboratory-based system following feedback from the wildlife forensic community. Data showed that there is strong support for field-based detection methods while highlighting concerns including contamination risks and reduced quality assurance associated with non-laboratory testing. Key species and applications were identified alongside hurdles to implementation and adoption. Broadly, the requirements align with many of the developmental drivers that have led to the rise of in-field portable detection instrumentation, specifically rapid detection within one hour, ease-of-use, and ≥95% accuracy. Several existing platforms exist that met some of the identified requirements but not all. With further collaboration between industry partners and the wildlife forensic community it is possible that new field-based systems can be developed and applied routinely.
Assuntos
Animais Selvagens/genética , Conservação dos Recursos Naturais , Impressões Digitais de DNA/instrumentação , Retroalimentação , Avaliação das Necessidades , Inquéritos e Questionários , Animais , Comércio , Crime , Espécies em Perigo de Extinção , Humanos , Competência Profissional , Especificidade da EspécieRESUMO
Variation in the 21 autosomal STRs detected by the GlobalFiler multiplex was investigated in a sample of 523 indigenous male Arabs from five geographic regions of Saudi Arabia. Although allele frequencies for the entire dataset were found to be broadly similar to those determined in previous studies of Saudi citizens, significant differences were found among regions. Heterozygote deficiency was observed at nearly all loci in all regions, probably as a consequence of high levels of consanguineous marriage; in the case of D2S1338, which showed the largest deviation from Hardy-Weinberg equilibrium, the presence of a null allele also played a part. Genetic distances were greatest between the Northern and Southern regions, whilst the West, Central and East appeared most similar to each other, and to previously published surveys. This contrasts with previously described variation among paternal lineages in the same sample-set: Y-chromosome variation was limited within the North/Central/South core compared with the more diverse East and West. Differences between autosomal and Y-chromosomal patterns may reflect genetic drift on the Y chromosome, exacerbated by prevalent patrilineal descent groups in different regions.
Assuntos
Consanguinidade , Genética Populacional , Repetições de Microssatélites , Cromossomos Humanos Y , Impressões Digitais de DNA , Variação Genética , Heterozigoto , Humanos , Masculino , Reação em Cadeia da Polimerase , Arábia SauditaRESUMO
While many studies have been undertaken of Middle Eastern populations using autosomal STR profiling by capillary electrophoresis, little has so far been published from this region on the forensic use of massively parallel sequencing (MPS). Here, we carried out MPS of 27 autosomal STRs and 91 identity-informative SNPs (iiSNPs) with the Verogen ForenSeq™ DNA Signature Prep Kit on a representative sample of 89 Saudi Arabian males, and analysed the resulting sequence data using Verogen's ForenSeq Universal Analysis Software (UAS) v1.3 and STRait Razor v3.0. This revealed sequence variation in the composition of complex STR arrays, and SNPs in their flanking regions, which raised the number of STR alleles from 238 distinct length variants to 357 sequence sub-variants. Similarly, between one and three additional polymorphic sites were observed within the amplicons of 37 of the 91 iiSNPs, forming up to six microhaplotypes per locus. These further enhance discrimination compared to the biallelic target SNP data presented by the primary UAS interface. In total, we observed twenty-two STR alleles previously unrecognised in the STRait Razor v3.0 default allele list, along with nine SNPs flanking target iiSNPs that were not highlighted by the UAS. Sequencing reduced the STR-based random match probability (RMP) from 2.62E-30 to 3.49E-34, and analysis of the iiSNP microhaplotypes reduced RMP from 9.97E-37 to 6.83E-40. The lack of significant linkage disequilibrium between STRs and target iiSNPs allowed the two marker types to be combined using the product rule, yielding a RMP of 2.39E-73. Evidence of consanguinity was apparent from both marker types. While TPOX was the only locus displaying a significant deviation from Hardy-Weinberg equilibrium, 23 out of 27 STRs and 63 out of 91 iiSNPs showed fewer than expected heterozygotes, demonstrating an overall homozygote excess probably reflecting the high frequency of first-cousin marriages in Saudi Arabia. We placed our data in a global context by considering the same markers in the Human Genome Diversity Panel (HGDP), revealing that the Saudi sample was typical of Middle Eastern populations, with a higher level of inbreeding than is seen in most European, African and Central/South Asian populations, correlating with known patterns of endogamy. Given reduced levels of diversity within endogamous groups, the ability to combine the discrimination power of both STRs and SNPs offers significant benefits in the analysis of forensic evidence in Saudi Arabia and the Middle East region more generally.
Assuntos
Consanguinidade , Sequenciamento de Nucleotídeos em Larga Escala , Repetições de Microssatélites , Polimorfismo de Nucleotídeo Único , Alelos , Feminino , Humanos , Masculino , Arábia Saudita , Análise de Sequência de DNARESUMO
Short tandem repeats on the male-specific region of the Y chromosome (Y-STRs) are permanently linked as haplotypes, and therefore Y-STR sequence diversity can be considered within the robust framework of a phylogeny of haplogroups defined by single nucleotide polymorphisms (SNPs). Here we use massively parallel sequencing (MPS) to analyse the 23 Y-STRs in Promega's prototype PowerSeq™ Auto/Mito/Y System kit (containing the markers of the PowerPlex® Y23 [PPY23] System) in a set of 100 diverse Y chromosomes whose phylogenetic relationships are known from previous megabase-scale resequencing. Including allele duplications and alleles resulting from likely somatic mutation, we characterised 2311 alleles, demonstrating 99.83% concordance with capillary electrophoresis (CE) data on the same sample set. The set contains 267 distinct sequence-based alleles (an increase of 58% compared to the 169 detectable by CE), including 60 novel Y-STR variants phased with their flanking sequences which have not been reported previously to our knowledge. Variation includes 46 distinct alleles containing non-reference variants of SNPs/indels in both repeat and flanking regions, and 145 distinct alleles containing repeat pattern variants (RPV). For DYS385a,b, DYS481 and DYS390 we observed repeat count variation in short flanking segments previously considered invariable, and suggest new MPS-based structural designations based on these. We considered the observed variation in the context of the Y phylogeny: several specific haplogroup associations were observed for SNPs and indels, reflecting the low mutation rates of such variant types; however, RPVs showed less phylogenetic coherence and more recurrence, reflecting their relatively high mutation rates. In conclusion, our study reveals considerable additional diversity at the Y-STRs of the PPY23 set via MPS analysis, demonstrates high concordance with CE data, facilitates nomenclature standardisation, and places Y-STR sequence variants in their phylogenetic context.
Assuntos
Cromossomos Humanos Y , Sequenciamento de Nucleotídeos em Larga Escala , Repetições de Microssatélites , Impressões Digitais de DNA , Variação Genética , Humanos , Masculino , Filogenia , Reação em Cadeia da PolimeraseRESUMO
Saudi Arabia's indigenous population is organized into patrilineal descent groups, but to date, little has been done to characterize its population structure, in particular with respect to the male-specific region of the Y chromosome. We have used the 27-STR Yfiler® Plus kit to generate haplotypes in 597 unrelated Saudi males, classified into five geographical regions (North, South, Central, East and West). Overall, Yfiler® Plus provides a good discrimination capacity of 95.3%, but this is greatly reduced (74.7%) when considering the reduced Yfiler® set of 17 Y-STRs, justifying the use of the expanded set of markers in this population. Comparison of the five geographical divisions reveals striking differences, with low diversity and similar haplotype spectra in the Central and Northern regions, and high diversity and similar haplotype spectra in the East and West. These patterns likely reflect the geographical isolation of the desert heartland of the peninsula, and the proximity to the sea of the Eastern and Western areas, and consequent historical immigration. We predicted haplogroups from Y-STR haplotypes, testing the performance of prediction by using a large independent set of Saudi Arabian Y-STRâ¯+â¯Y-SNP data. Prediction indicated predominance (71%) of haplogroup J1, which was significantly more common in Central, Northern and Southern groups than in East and West, and formed a star-like expansion cluster in a median-joining network with an estimated age of â¼2800 years. Most of our 597 participants were sampled within Saudi Arabia itself, but â¼16% were sampled in the UK. Despite matching these two groups by home sub-region, we observed significant differences in haplotype and predicted haplogroup constitutions overall, and for most sub-regions individually. This suggests social structure influencing the probability of leaving Saudi Arabia, correlated with different Y-chromosome compositions. The UK-recruited sample is an inappropriate proxy for Saudi Arabia generally, and caution is needed when considering expatriate groups as representative of country of origin. Our study shows the importance of geographical and social structuring that may affect the utility of forensic databases and the interpretation of Y-STR profiles.
Assuntos
Cromossomos Humanos Y , Genética Populacional , Repetições de Microssatélites , Impressões Digitais de DNA , Haplótipos , Humanos , Masculino , Polimorfismo de Nucleotídeo Único , Arábia SauditaRESUMO
DNA variation in 402bp of the mitochondrial control region flanked by repeat sequences RS2 and RS3 was evaluated by Sanger sequencing in 152 English domestic cats, in order to determine the significance of matching DNA sequences between hairs found with a victim's body and the suspect's pet cat. Whilst 95% of English cats possessed one of the twelve globally widespread mitotypes, four new variants were observed, the most common of which (2% frequency) was shared with the evidential samples. No significant difference in mitotype frequency was seen between 32 individuals from the locality of the crime and 120 additional cats from the rest of England, suggesting a lack of local population structure. However, significant differences were observed in comparison with frequencies in other countries, including the closely neighbouring Netherlands, highlighting the importance of appropriate genetic databases when determining the evidential significance of mitochondrial DNA evidence.