RESUMO
BACKGROUND: Short tandem repeats (STRs) are found in many prokaryotic and eukaryotic genomes, and are commonly used as genetic markers, in particular for identity and parental testing in DNA forensics. The unstable expansion of some STRs was associated with various genetic disorders (e.g., the Huntington disease), and thus was used in genetic testing for screening individuals at high risk. Traditional STR analyses were based on the PCR amplification of STR loci followed by gel electrophoresis. With the availability of massive whole genome sequencing data, it becomes practical to mine STR profiles in silico from genome sequences. Software tools such as lobSTR and STR-FM have been developed to address these demands, which are, however, built upon whole genome reads mapping tools, and thus may not be sensitive enough. RESULTS: In this paper, we present a standalone software tool STRScan that uses a greedy algorithm for targeted STR profiling in next-generation sequencing (NGS) data. STRScan was tested on the whole genome sequencing data from Venter genome sequencing and 1000 Genomes Project. The results showed that STRScan can profile 20% more STRs in the target set that are missed by lobSTR. CONCLUSION: STRScan is particularly useful for the NGS-based targeted STR profiling, e.g., in genetic and human identity testing. STRScan is available as open-source software at http://darwin.informatics.indiana.edu/str/ .
Assuntos
Bases de Dados de Ácidos Nucleicos , Repetições de Microssatélites/genética , Software , Sequenciamento Completo do Genoma , Algoritmos , Sequenciamento de Nucleotídeos em Larga Escala , HumanosRESUMO
Mass migration and migrant death at the U.S. southern border highlight the disconnectedness of the systems for transnational decedent identifications. Death investigation cases in Texas face delays and barriers at all stages of an investigation. Additionally, fragmentation of DNA databases exacerbate challenges in comparing genetic samples from unidentified human remains (UHR) and families of the missing. We sought to pilot alternative workflows for processing UHR and family reference samples (FRS) for the identification of probable migrant decedents. Primarily using Rapid DNA, but also accredited non-CODIS DNA laboratories, the piloted approaches were conducted in parallel to existing medicolegal workflows under the relevant case jurisdictional guidance. Our data show that Rapid DNA is a valid path for anthropology laboratories to support identification hypotheses and that accredited non-CODIS forensic and genetic laboratories also can support families to identify remains, especially when families reside outside of the United States.
RESUMO
The Bull Shark (Carcharhinus leucas) faces varying levels of exploitation around the world due to its coastal distribution. Information regarding population connectivity is crucial to evaluate its conservation status and local fishing impacts. In this study, we sampled 922 putative Bull Sharks from 19 locations in the first global assessment of population structure of this cosmopolitan species. Using a recently developed DNA-capture approach (DArTcap), samples were genotyped for 3400 nuclear markers. Additionally, full mitochondrial genomes of 384 Indo-Pacific samples were sequenced. Reproductive isolation was found between and across ocean basins (eastern Pacific, western Atlantic, eastern Atlantic, Indo-West Pacific) with distinct island populations in Japan and Fiji. Bull Sharks appear to maintain gene flow using shallow coastal waters as dispersal corridors, whereas large oceanic distances and historical land-bridges act as barriers. Females tend to return to the same area for reproduction, making them more susceptible to local threats and an important focus for management actions. Given these behaviors, the exploitation of Bull Sharks from insular populations, such as Japan and Fiji, may instigate local decline that cannot readily be replenished by immigration, which can in turn affect ecosystem dynamics and functions. These data also supported the development of a genetic panel to ascertain the population of origin, which will be useful in monitoring the trade of fisheries products and assessing population-level impacts of this harvest.
RESUMO
The widely recounted story of the origin of cultivated strawberry (Fragaria × ananassa) oversimplifies the complex interspecific hybrid ancestry of the highly admixed populations from which heirloom and modern cultivars have emerged. To develop deeper insights into the three-century-long domestication history of strawberry, we reconstructed the genealogy as deeply as possible-pedigree records were assembled for 8,851 individuals, including 2,656 cultivars developed since 1775. The parents of individuals with unverified or missing pedigree records were accurately identified by applying an exclusion analysis to array-genotyped single-nucleotide polymorphisms. We identified 187 wild octoploid and 1,171 F. × ananassa founders in the genealogy, from the earliest hybrids to modern cultivars. The pedigree networks for cultivated strawberry are exceedingly complex labyrinths of ancestral interconnections formed by diverse hybrid ancestry, directional selection, migration, admixture, bottlenecks, overlapping generations, and recurrent hybridization with common ancestors that have unequally contributed allelic diversity to heirloom and modern cultivars. Fifteen to 333 ancestors were predicted to have transmitted 90% of the alleles found in country-, region-, and continent-specific populations. Using parent-offspring edges in the global pedigree network, we found that selection cycle lengths over the past 200 years of breeding have been extraordinarily long (16.0-16.9 years/generation), but decreased to a present-day range of 6.0-10.0 years/generation. Our analyses uncovered conspicuous differences in the ancestry and structure of North American and European populations, and shed light on forces that have shaped phenotypic diversity in F. × ananassa.
Assuntos
Domesticação , Fragaria , Fragaria/genética , Hibridização Genética , Melhoramento VegetalRESUMO
Birds of prey have suffered persecution for centuries through trapping, shooting, poisoning and theft from the wild to meet the demand from egg collectors and falconers; they were also amongst the earliest beneficiaries of DNA testing in wildlife forensics. Here we report the identification and characterisation of 14 novel tetramer, pentamer and hexamer short tandem repeat (STR) markers which can be typed either by capillary electrophoresis or massively parallel sequencing (MPS) and apply them to historical casework samples involving 49 peregrine falcons, 30 of which were claimed to be the captively bred offspring of nine pairs. The birds were initially tested in 1994 with a multilocus DNA fingerprinting probe, a sex test and eight single-locus minisatellite probes (SLPs) demonstrating that 23 birds were unrelated to the claimed parents. The multilocus and SLP approaches were highly discriminating but extremely time consuming and required microgram quantities of high molecular weight DNA and the use of radioisotopes. The STR markers displayed between 2 and 21 alleles per locus (mean = 7.6), lengths between 140 and 360 bp, and heterozygosities from 0.4 to 0.93. They produced wholly concordant conclusions with similar discrimination power but in a fraction of the time using a hundred-fold less DNA and with standard forensic equipment. Furthermore, eleven of these STRs were amplified in a single reaction and typed using MPS on the Illumina MiSeq platform revealing eight additional alleles (three with variant repeat structures and five solely due to flanking SNPs) across four loci. This approach gave a random match probability of < 1E-9, and a parental pair false inclusion probability of < 1E-5, with a further ten-fold reduction in the amount of DNA required (~3 ng) and the potential to analyse mixed samples. These STRs will be of value in monitoring wild populations of these key indicator species as well as for testing captive breeding claims and establishing a database of captive raptors. They have the potential to resolve complex cases involving trace, mixed and degraded samples from raptor persecution casework representing a significant advance over the previously applied methods.
Assuntos
Animais Selvagens , Repetições Minissatélites , Animais , Crime , DNA/genética , Impressões Digitais de DNA , Eletroforese Capilar , Sequenciamento de Nucleotídeos em Larga Escala , Repetições de Microssatélites , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNARESUMO
Wet-lab based studies have exploited emerging single-cell technologies to address the challenges of interpreting forensic mixture evidence. However, little effort has been dedicated to developing a systematic approach to interpreting the single-cell profiles derived from the mixtures. This study is the first attempt to develop a comprehensive interpretation workflow in which single-cell profiles from mixtures are interpreted individually and holistically. In this approach, the genotypes from each cell are assessed, the number of contributors (NOC) of the single-cell profiles is estimated, followed by developing a consensus profile of each contributor, and finally the consensus profile(s) can be used for a DNA database search or comparing with known profiles to determine their potential sources. The potential of this single-cell interpretation workflow was assessed by simulation with various mixture scenarios and empirical allele drop-out and drop-in rates, the accuracies of estimating the NOC, the accuracies of recovering the true alleles by consensus, and the capabilities of deconvolving mixtures with related contributors. The results support that the single-cell based mixture interpretation can provide a precision that cannot beachieved with current standard CE-STR analyses. A new paradigm for mixture interpretation is available to enhance the interpretation of forensic genetic casework.
Assuntos
DNA/análise , Genética Forense , Análise de Célula Única/métodos , Algoritmos , Alelos , Análise por Conglomerados , DNA/química , DNA/genética , Contaminação por DNA , Impressões Digitais de DNA/métodos , Genética Forense/métodos , Genética Forense/tendências , Técnicas Genéticas , Genótipo , Humanos , Repetições de MicrossatélitesRESUMO
High-throughput sequencing (HTS) of large panels of single nucleotide polymorphisms (SNPs) provides an alternative or complimentary approach to short tandem repeats (STRs) panels for the analysis of complex DNA mixture forensic samples. For STRs, methods to estimate individual contribution concentrations compare capillary electrophoresis peak heights, peak areas, or HTS allele read counts within a mixture. This article introduces three approaches (mean, median, and slope methods) for estimating individual DNA contributions to forensic mixtures for HTS/massively parallel sequencing (MPS) SNP panels. For SNPs, the major:minor allele ratios or counts, unique to each contributor, were compared to estimate contributor proportion within the mixture using the mean, median, and slope intercept for these alleles. The estimates for these three methods were typically within 5% of planned experimental contributions for defined mixtures.
Assuntos
DNA/genética , Sequenciamento de Nucleotídeos em Larga Escala , Polimorfismo de Nucleotídeo Único , Alelos , Impressões Digitais de DNA , Genética Forense/métodos , Humanos , Modelos LinearesRESUMO
Elasmobranchs, the group of cartilaginous fishes that include sharks and rays, are especially vulnerable to overfishing due to low fecundity and late sexual maturation. A significant number of elasmobranch species are currently overexploited or threatened by fisheries activities. Additionally, several recent reports have indicated that there has been a reduction in regional elasmobranch population sizes. Brazil is an important player in elasmobranch fisheries and one of the largest importers of shark meat. However, carcasses entering the shark meat market have usually had their fins and head removed, which poses a challenge to reliable species identification based on the morphology of captured individuals. This is further complicated by the fact that the internal Brazilian market trades several different elasmobranch species under a common popular name: "cação." The use of such imprecise nomenclature, even among governmental agencies, is problematic for both controlling the negative effects of shark consumption and informing the consumer about the origins of the product. In this study, we used DNA barcoding (mtDNA, COI gene) to identify, at the species level, "cação" samples available in local markets from Southern Brazil. We collected 63 samples traded as "cação," which we found to correspond to 20 different species. These included two teleost species: Xiphias gladius (n = 1) and Genidens barbus (n = 6), and 18 species from seven elasmobranch orders (Carcharhiniformes, n = 42; Squaliformes, n = 3; Squatiniformes, n = 2; Rhinopristiformes, n = 4; Myliobatiformes, n = 3; Rajiformes, n = 1; and Torpediniformes, n = 1). The most common species in our sample were Prionace glauca (n = 15) and Sphyrna lewini (n = 14), while all other species were represented by four samples or less. Considering IUCN criteria, 47% of the elasmobranch species found are threatened at the global level, while 53% are threatened and 47% are critically endangered in Brazil. These results underline that labeling the meat of any shark species as "cação" is problematic for monitoring catch allocations from the fishing industry and discourages consumer engagement in conservationist practices through informed decision-making.
RESUMO
Certain articles of worship are commonly sold in Uttarakhand, India by the name Hatha Jodi, a root of a rare plant found only in a few parts of central India. The present work provides genetic proof that the Hatha Jodi sold in three local markets of Uttarakhand contained material from the Varanus species, species protected under the Indian Wildlife (Protection) Act, 1972. A total of eight samples were bought, two each from the local markets in Haridwar and Rishikesh, three from Dehradun and one from an online source (Amazon). The initial inspection confirmed that two of the samples were made of plastic material. Therefore only the other six samples were subjected to DNA analysis. DNA sequences were successfully obtained and matched with reference sequences available in NCBI Genbank database through BLAST search tool for species identification. All the six samples matched 100% with the Indian monitor lizard. The findings indicate how commercialization and the wildlife trade are playing a role in decline of the population of the Indian monitor lizard. If strong protection measures are not taken as soon as possible, the Indian monitor lizards will go Extinct very soon. Therefore, we suggest that the Government and Wildlife enforcement agencies take serious action against the illegal articles available in the local markets of Uttarakhand under the name Hatha Jodi. Further, the government needs to take legal action against offenders in other states in which the product is available for sale.
RESUMO
High-throughput sequencing (HTS) of single nucleotide polymorphisms (SNPs) enables additional DNA forensic capabilities not attainable using traditional STR panels. However, the inclusion of sets of loci selected for mixture analysis, extended kinship, phenotype, biogeographic ancestry prediction, etc., can result in large panel sizes that are difficult to analyze in a rapid fashion. GrigoraSNP was developed to address the allele-calling bottleneck that was encountered when analyzing SNP panels with more than 5000 loci using HTS. GrigoraSNPs uses a MapReduce parallel data processing on multiple computational threads plus a novel locus-identification hashing strategy leveraging target sequence tags. This tool optimizes the SNP calling module of the DNA analysis pipeline with runtimes that scale linearly with the number of HTS reads. Results are compared with SNP analysis pipelines implemented with SAMtools and GATK. GrigoraSNPs removes a computational bottleneck for processing forensic samples with large HTS SNP panels.
Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA/métodos , Software , Alelos , Impressões Digitais de DNA/métodos , Humanos , Reação em Cadeia da PolimeraseRESUMO
Patrilineally transmitted Y-chromosomal markers have been shown to resolve forensic cases under certain scenarios where autosomal markers provide limited or inconclusive evidence. Several Y-short tandem repeat (Y-STR) loci have been validated for forensic use, and Y-STR haplotype databases for a number of geographic locations are now available in the literature and online. In this review, examples are presented of situations where Y-STR loci can provide valuable supplemental forensic evidence when autosomal STR loci fail or provide little evidence. Also, different methods of interpreting Y-STR forensic evidence in casework analysis are outlined, suggesting that in spite of excessive conservativeness, the counting method is still the most simple and easily defensible method of interpreting Y-STR forensic evidence. While the need for developing more powerful interpretation methods should be considered, it is stressed that expansion of Y-STR databases should focus on increasing sample sizes and the inclusion of more anthropologically defined populations to improve the efficiency of interpretation of Y-STR markers in forensic applications.