Your browser doesn't support javascript.
loading
SPLASH: A statistical, reference-free genomic algorithm unifies biological discovery.
Chaung, Kaitlin; Baharav, Tavor Z; Henderson, George; Zheludev, Ivan N; Wang, Peter L; Salzman, Julia.
Afiliação
  • Chaung K; Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, USA; Department of Biochemistry, Stanford University, Stanford, CA 94305, USA.
  • Baharav TZ; Department of Electrical Engineering, Stanford University, Stanford, CA 94305, USA.
  • Henderson G; Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, USA; Department of Biochemistry, Stanford University, Stanford, CA 94305, USA.
  • Zheludev IN; Department of Biochemistry, Stanford University, Stanford, CA 94305, USA.
  • Wang PL; Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, USA; Department of Biochemistry, Stanford University, Stanford, CA 94305, USA.
  • Salzman J; Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, USA; Department of Biochemistry, Stanford University, Stanford, CA 94305, USA; Department of Statistics (by courtesy), Stanford University, Stanford, CA 94305, USA; Department of Biology (by courtesy), Stanford Universit
Cell ; 186(25): 5440-5456.e26, 2023 12 07.
Article em En | MEDLINE | ID: mdl-38065078
ABSTRACT
Today's genomics workflows typically require alignment to a reference sequence, which limits discovery. We introduce a unifying paradigm, SPLASH (Statistically Primary aLignment Agnostic Sequence Homing), which directly analyzes raw sequencing data, using a statistical test to detect a signature of regulation sample-specific sequence variation. SPLASH detects many types of variation and can be efficiently run at scale. We show that SPLASH identifies complex mutation patterns in SARS-CoV-2, discovers regulated RNA isoforms at the single-cell level, detects the vast sequence diversity of adaptive immune receptors, and uncovers biology in non-model organisms undocumented in their reference genomes geographic and seasonal variation and diatom association in eelgrass, an oceanic plant impacted by climate change, and tissue-specific transcripts in octopus. SPLASH is a unifying approach to genomic analysis that enables expansive discovery without metadata or references.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Algoritmos / Genômica Limite: Humans Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Algoritmos / Genômica Limite: Humans Idioma: En Ano de publicação: 2023 Tipo de documento: Article