Ultrasensitive sequencing of STR markers utilizing unique molecular identifiers and the SiMSen-Seq method.
Forensic Sci Int Genet
; 71: 103047, 2024 Jul.
Article
en En
| MEDLINE
| ID: mdl-38598919
ABSTRACT
Massively parallel sequencing (MPS) is increasingly applied in forensic short tandem repeat (STR) analysis. The presence of stutter artefacts and other PCR or sequencing errors in the MPS-STR data partly limits the detection of low DNA amounts, e.g., in complex mixtures. Unique molecular identifiers (UMIs) have been applied in several scientific fields to reduce noise in sequencing. UMIs consist of a stretch of random nucleotides, a unique barcode for each starting DNA molecule, that is incorporated in the DNA template using either ligation or PCR. The barcode is used to generate consensus reads, thus removing errors. The SiMSen-Seq (Simple, multiplexed, PCR-based barcoding of DNA for sensitive mutation detection using sequencing) method relies on PCR-based introduction of UMIs and includes a sophisticated hairpin design to reduce unspecific primer binding as well as PCR protocol adjustments to further optimize the reaction. In this study, SiMSen-Seq is applied to develop a proof-of-concept seven STR multiplex for MPS library preparation and an associated bioinformatics pipeline. Additionally, machine learning (ML) models were evaluated to further improve UMI allele calling. Overall, the seven STR multiplex resulted in complete detection and concordant alleles for 47 single-source samples at 1â¯ng input DNA as well as for low-template samples at 62.5â¯pg input DNA. For twelve challenging mixtures with minor contributions of 10â¯pg to 150â¯pg and ratios of 1-15% relative to the major donor, 99.2% of the expected alleles were detected by applying the UMIs in combination with an ML filter. The main impact of UMIs was a substantially lowered number of artefacts as well as reduced stutter ratios, which were generally below 5% of the parental allele. In conclusion, UMI-based STR sequencing opens new means for improved analysis of challenging crime scene samples including complex mixtures.
Palabras clave
Texto completo:
1
Banco de datos:
MEDLINE
Asunto principal:
Dermatoglifia del ADN
/
Repeticiones de Microsatélite
/
Secuenciación de Nucleótidos de Alto Rendimiento
Límite:
Humans
Idioma:
En
Revista:
Forensic Sci Int Genet
Asunto de la revista:
GENETICA
/
JURISPRUDENCIA
Año:
2024
Tipo del documento:
Article
País de afiliación:
Suecia