Your browser doesn't support javascript.
loading
Coarse Graining of Data via Inhomogeneous Diffusion Condensation.
Brugnone, Nathan; Gonopolskiy, Alex; Moyle, Mark W; Kuchroo, Manik; van Dijk, David; Moon, Kevin R; Colon-Ramos, Daniel; Wolf, Guy; Hirn, Matthew J; Krishnaswamy, Smita.
Affiliation
  • Brugnone N; Dept. of Comp. Math., Sci. & Eng., Michigan State University, East Lansing, MI, USA.
  • Gonopolskiy A; PicnicHealth, Berlin, Germany.
  • Moyle MW; Dept. of Neuroscience, Yale University, New Haven, CT, USA.
  • Kuchroo M; Interdept. Neurosci. Prog., Yale University, New Haven, CT, USA.
  • van Dijk D; Dept. of Internal Medicine, Dept. of Computer Science, Yale University, New Haven, CT, USA.
  • Moon KR; Dept. of Math. and Stat., Utah State University, Logan, UT, USA.
  • Colon-Ramos D; Dept. of Neuroscience, Yale University, New Haven, CT, USA.
  • Wolf G; Dept. of Math. and Stat., Univ. de Montréal; Mila, Montreal, QC, Canada.
  • Hirn MJ; Dept. of Comp. Math., Sci. & Eng., Dept. of Mathematics, Michigan State University, East Lansing, MI, USA.
  • Krishnaswamy S; Dept. of Genetics, Dept. of Computer Science, Yale University, New Haven, CT, USA.
Proc IEEE Int Conf Big Data ; 2019: 2624-2633, 2019 Dec.
Article in En | MEDLINE | ID: mdl-32747879
Big data often has emergent structure that exists at multiple levels of abstraction, which are useful for characterizing complex interactions and dynamics of the observations. Here, we consider multiple levels of abstraction via a multiresolution geometry of data points at different granularities. To construct this geometry we define a time-inhomogemeous diffusion process that effectively condenses data points together to uncover nested groupings at larger and larger granularities. This inhomogeneous process creates a deep cascade of intrinsic low pass filters on the data affinity graph that are applied in sequence to gradually eliminate local variability while adjusting the learned data geometry to increasingly coarser resolutions. We provide visualizations to exhibit our method as a "continuously-hierarchical" clustering with directions of eliminated variation highlighted at each step. The utility of our algorithm is demonstrated via neuronal data condensation, where the constructed multiresolution data geometry uncovers the organization, grouping, and connectivity between neurons.
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: Proc IEEE Int Conf Big Data Year: 2019 Document type: Article Affiliation country: United States Country of publication: United States

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: Proc IEEE Int Conf Big Data Year: 2019 Document type: Article Affiliation country: United States Country of publication: United States