Your browser doesn't support javascript.
loading
Effective gene expression prediction from sequence by integrating long-range interactions.
Avsec, Ziga; Agarwal, Vikram; Visentin, Daniel; Ledsam, Joseph R; Grabska-Barwinska, Agnieszka; Taylor, Kyle R; Assael, Yannis; Jumper, John; Kohli, Pushmeet; Kelley, David R.
Afiliación
  • Avsec Z; DeepMind, London, UK. avsec@google.com.
  • Agarwal V; Calico Life Sciences, South San Francisco, CA, USA.
  • Visentin D; DeepMind, London, UK.
  • Ledsam JR; DeepMind, London, UK.
  • Grabska-Barwinska A; Google, Tokyo, Japan.
  • Taylor KR; DeepMind, London, UK.
  • Assael Y; DeepMind, London, UK.
  • Jumper J; DeepMind, London, UK.
  • Kohli P; DeepMind, London, UK.
  • Kelley DR; DeepMind, London, UK. pushmeet@google.com.
Nat Methods ; 18(10): 1196-1203, 2021 10.
Article en En | MEDLINE | ID: mdl-34608324
ABSTRACT
How noncoding DNA determines gene expression in different cell types is a major unsolved problem, and critical downstream applications in human genetics depend on improved solutions. Here, we report substantially improved gene expression prediction accuracy from DNA sequences through the use of a deep learning architecture, called Enformer, that is able to integrate information from long-range interactions (up to 100 kb away) in the genome. This improvement yielded more accurate variant effect predictions on gene expression for both natural genetic variants and saturation mutagenesis measured by massively parallel reporter assays. Furthermore, Enformer learned to predict enhancer-promoter interactions directly from the DNA sequence competitively with methods that take direct experimental data as input. We expect that these advances will enable more effective fine-mapping of human disease associations and provide a framework to interpret cis-regulatory evolution.
Asunto(s)

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: ADN / Regulación de la Expresión Génica / Bases de Datos Genéticas / Epigénesis Genética / Aprendizaje Automático / Red Nerviosa Tipo de estudio: Prognostic_studies / Risk_factors_studies Límite: Animals / Humans Idioma: En Revista: Nat Methods Asunto de la revista: TECNICAS E PROCEDIMENTOS DE LABORATORIO Año: 2021 Tipo del documento: Article País de afiliación: Reino Unido

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: ADN / Regulación de la Expresión Génica / Bases de Datos Genéticas / Epigénesis Genética / Aprendizaje Automático / Red Nerviosa Tipo de estudio: Prognostic_studies / Risk_factors_studies Límite: Animals / Humans Idioma: En Revista: Nat Methods Asunto de la revista: TECNICAS E PROCEDIMENTOS DE LABORATORIO Año: 2021 Tipo del documento: Article País de afiliación: Reino Unido