Búsqueda | Portal de Búsqueda de la BVS

Optimal trade-off control in machine learning-based library design, with application to adeno-associated virus (AAV) for gene therapy.

Zhu, Danqing; Brookes, David H; Busia, Akosua; Carneiro, Ana; Fannjiang, Clara; Popova, Galina; Shin, David; Donohue, Kevin C; Lin, Li F; Miller, Zachary M; Williams, Evan R; Chang, Edward F; Nowakowski, Tomasz J; Listgarten, Jennifer; Schaffer, David V.

Sci Adv ; 10(4): eadj3786, 2024 Jan 26.

Artículo en Inglés | MEDLINE | ID: mdl-38266077

RESUMEN

Adeno-associated viruses (AAVs) hold tremendous promise as delivery vectors for gene therapies. AAVs have been successfully engineered-for instance, for more efficient and/or cell-specific delivery to numerous tissues-by creating large, diverse starting libraries and selecting for desired properties. However, these starting libraries often contain a high proportion of variants unable to assemble or package their genomes, a prerequisite for any gene delivery goal. Here, we present and showcase a machine learning (ML) method for designing AAV peptide insertion libraries that achieve fivefold higher packaging fitness than the standard NNK library with negligible reduction in diversity. To demonstrate our ML-designed library's utility for downstream engineering goals, we show that it yields approximately 10-fold more successful variants than the NNK library after selection for infection of human brain tissue, leading to a promising glial-specific variant. Moreover, our design approach can be applied to other types of libraries for AAV and beyond.

Asunto(s)

Dependovirus , Terapia Genética , Humanos , Dependovirus/genética , Biblioteca de Péptidos , Encéfalo , Aprendizaje Automático

MBE: model-based enrichment estimation and prediction for differential sequencing data.

Busia, Akosua; Listgarten, Jennifer.

Genome Biol ; 24(1): 218, 2023 Oct 02.

Artículo en Inglés | MEDLINE | ID: mdl-37784130

RESUMEN

Characterizing differences in sequences between two conditions, such as with and without drug exposure, using high-throughput sequencing data is a prevalent problem involving quantifying changes in sequence abundances, and predicting such differences for unobserved sequences. A key shortcoming of current approaches is their extremely limited ability to share information across related but non-identical reads. Consequently, they cannot use sequencing data effectively, nor be directly applied in many settings of interest. We introduce model-based enrichment (MBE) to overcome this shortcoming. We evaluate MBE using both simulated and real data. Overall, MBE improves accuracy compared to current differential analysis methods.

RESUMEN

Asunto(s)

RESUMEN

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA