RESUMEN
Efficient computation of optimal transport distance between distributions is of growing importance in data science. Sinkhorn-based methods are currently the state-of-the-art for such computations, but require On2 computations. In addition, Sinkhorn-based methods commonly use an Euclidean ground distance between datapoints. However, with the prevalence of manifold structured scientific data, it is often desirable to consider geodesic ground distance. Here, we tackle both issues by proposing Geodesic Sinkhorn-based on diffusing a heat kernel on a manifold graph. Notably, Geodesic Sinkhorn requires only O(nlogâ¡n) computation, as we approximate the heat kernel with Chebyshev polynomials based on the sparse graph Laplacian. We apply our method to the computation of barycenters of several distributions of high dimensional single cell data from patient samples undergoing chemotherapy. In particular, we define the barycentric distance as the distance between two such barycenters. Using this definition, we identify an optimal transport distance and path associated with the effect of treatment on cellular data.
RESUMEN
Organoids are biomimetic tissue models comprising multiple cell types and cell states. Post-translational modification (PTM) signaling networks control cellular phenotypes and are frequently dysregulated in diseases such as cancer. Although signaling networks vary across cell types, there are limited techniques to study cell type-specific PTMs in heterocellular organoids. Here, we present a multiplexed mass cytometry (MC) protocol for single-cell analysis of PTM signaling and cell states in organoids and organoids co-cultured with fibroblasts and leukocytes. We describe how thiol-reactive organoid barcoding in situ (TOBis) enables 35-plex and 126-plex single-cell comparison of organoid cultures and provide a cytometry by time of flight (CyTOF) signaling analysis pipeline (CyGNAL) for computing cell type-specific PTM signaling networks. The TOBis MC protocol takes ~3 d from organoid fixation to data acquisition and can generate single-cell data for >40 antibodies from millions of cells across 126 organoid cultures in a single MC run.