Publications | Dhananjay Bhaskar

Journal Articles

2024

IEEE TOSP

Learnable Filters for Geometric Scattering Modules

Alexander Tong, Frederik Wenkel, Dhananjay Bhaskar, Kincaid Macdonald, Jackson Grady, Michael Perlmutter, Smita Krishnaswamy, and Guy Wolf

IEEE Transactions on Signal Processing, 2024

Conference Name: IEEE Transactions on Signal Processing

Abs Code

We propose a new graph neural network (GNN) module, based on relaxations of recently proposed geometric scattering transforms, which consist of a cascade of graph wavelet filters. Our learnable geometric scattering (LEGS) module enables adaptive tuning of the wavelets to encourage band-pass features to emerge in learned representations. The incorporation of our LEGS-module in GNNs enables the learning of longer-range graph relations compared to many popular GNNs, which often rely on encoding graph structure via smoothness or similarity between neighbors. Further, its wavelet priors result in simplified architectures with significantly fewer learned parameters compared to competing GNNs. We demonstrate the predictive performance of LEGS-based networks on graph classification benchmarks, as well as the descriptive quality of their learned features in biochemical graph data exploration tasks. Our results show that LEGS-based networks match or outperforms popular GNNs, as well as the original geometric scattering construction, on many datasets, in particular in biochemical domains, while retaining certain mathematical properties of handcrafted (non-learned) geometric scattering.
Sci Rep

Dissecting glial scar formation by spatial point pattern and topological data analysis

Daniel Manrique-Castano, Dhananjay Bhaskar, and Ayman ElAli

Scientific Reports, Aug 2024

Publisher: Nature Publishing Group

Abs Code

Glial scar formation represents a fundamental response to central nervous system (CNS) injuries. It is mainly characterized by a well-defined spatial rearrangement of reactive astrocytes and microglia. The mechanisms underlying glial scar formation have been extensively studied, yet quantitative descriptors of the spatial arrangement of reactive glial cells remain limited. Here, we present a novel approach using point pattern analysis (PPA) and topological data analysis (TDA) to quantify spatial patterns of reactive glial cells after experimental ischemic stroke in mice. We provide open and reproducible tools using R and Julia to quantify spatial intensity, cell covariance and conditional distribution, cell-to-cell interactions, and short/long-scale arrangement, which collectively disentangle the arrangement patterns of the glial scar. This approach unravels a substantial divergence in the distribution of GFAP+ and IBA1+ cells after injury that conventional analysis methods cannot fully characterize. PPA and TDA are valuable tools for studying the complex spatial arrangement of reactive glia and other nervous cells following CNS injuries and have potential applications for evaluating glial-targeted restorative therapies.

2023

JCB

Cell cycle controls long-range calcium signaling in the regenerating epidermis

Jessica L. Moore, Dhananjay Bhaskar, Feng Gao, Catherine Matte-Martone, Shuangshuang Du, Elizabeth Lathrop, Smirthy Ganesan, Lin Shao, Rachael Norris, Nil Campamà Sanz, Karl Annusver, Maria Kasper, Andy Cox, Caroline Hendry, Bastian Rieck, Smita Krishnaswamy, and Valentina Greco

The Journal of Cell Biology, Jul 2023

Abs

Skin homeostasis is maintained by stem cells, which must communicate to balance their regenerative behaviors. Yet, how adult stem cells signal across regenerative tissue remains unknown due to challenges in studying signaling dynamics in live mice. We combined live imaging in the mouse basal stem cell layer with machine learning tools to analyze patterns of Ca2+ signaling. We show that basal cells display dynamic intercellular Ca2+ signaling among local neighborhoods. We find that these Ca2+ signals are coordinated across thousands of cells and that this coordination is an emergent property of the stem cell layer. We demonstrate that G2 cells are required to initiate normal levels of Ca2+ signaling, while connexin43 connects basal cells to orchestrate tissue-wide coordination of Ca2+ signaling. Lastly, we find that Ca2+ signaling drives cell cycle progression, revealing a communication feedback loop. This work provides resolution into how stem cells at different cell cycle stages coordinate tissue-wide signaling during epidermal regeneration.
NPJ

Topological data analysis of spatial patterning in heterogeneous cell populations: clustering and sorting with varying cell-cell adhesion

Dhananjay Bhaskar, William Y. Zhang, Alexandria Volkening, Björn Sandstede, and Ian Y. Wong

npj Systems Biology and Applications, Sep 2023

Number: 1 Publisher: Nature Publishing Group

Abs

Different cell types aggregate and sort into hierarchical architectures during the formation of animal tissues. The resulting spatial organization depends (in part) on the strength of adhesion of one cell type to itself relative to other cell types. However, automated and unsupervised classification of these multicellular spatial patterns remains challenging, particularly given their structural diversity and biological variability. Recent developments based on topological data analysis are intriguing to reveal similarities in tissue architecture, but these methods remain computationally expensive. In this article, we show that multicellular patterns organized from two interacting cell types can be efficiently represented through persistence images. Our optimized combination of dimensionality reduction via autoencoders, combined with hierarchical clustering, achieved high classification accuracy for simulations with constant cell numbers. We further demonstrate that persistence images can be normalized to improve classification for simulations with varying cell numbers due to proliferation. Finally, we systematically consider the importance of incorporating different topological features as well as information about each cell type to improve classification accuracy. We envision that topological machine learning based on persistence images will enable versatile and robust classification of complex tissue architectures that occur in development and disease.

2022

Nature

Transformer-based protein generation with regularized latent space optimization

Egbert Castro, Abhinav Godavarthi, Julian Rubinfien, Kevin Givechian, Dhananjay Bhaskar, and Smita Krishnaswamy

Nature Machine Intelligence, Oct 2022

Number: 10 Publisher: Nature Publishing Group

Abs

The development of powerful natural language models has improved the ability to learn meaningful representations of protein sequences. In addition, advances in high-throughput mutagenesis, directed evolution and next-generation sequencing have allowed for the accumulation of large amounts of labelled fitness data. Leveraging these two trends, we introduce Regularized Latent Space Optimization (ReLSO), a deep transformer-based autoencoder, which features a highly structured latent space that is trained to jointly generate sequences as well as predict fitness. Through regularized prediction heads, ReLSO introduces a powerful protein sequence encoder and a novel approach for efficient fitness landscape traversal. Using ReLSO, we explicitly model the sequence–function landscape of large labelled datasets and generate new molecules by optimizing within the latent space using gradient-based methods. We evaluate this approach on several publicly available protein datasets, including variant sets of anti-ranibizumab and green fluorescent protein. We observe a greater sequence optimization efficiency (increase in fitness per optimization step) using ReLSO compared with other approaches, where ReLSO more robustly generates high-fitness sequences. Furthermore, the attention-based relationships learned by the jointly trained ReLSO models provide a potential avenue towards sequence-level fitness attribution information.

2021

RSC

Topological data analysis of collective and individual epithelial cells using persistent homology of loops

Dhananjay Bhaskar, William Y. Zhang, and Ian Y. Wong

Soft Matter, May 2021

Publisher: The Royal Society of Chemistry

Abs PDF

Interacting, self-propelled particles such as epithelial cells can dynamically self-organize into complex multicellular patterns, which are challenging to classify without a priori information. Classically, different phases and phase transitions have been described based on local ordering, which may not capture structural features at larger length scales. Instead, topological data analysis (TDA) determines the stability of spatial connectivity at varying length scales (i.e. persistent homology), and can compare different particle configurations based on the “cost” of reorganizing one configuration into another. Here, we demonstrate a topology-based machine learning approach for unsupervised profiling of individual and collective phases based on large-scale loops. We show that these topological loops (i.e. dimension 1 homology) are robust to variations in particle number and density, particularly in comparison to connected components (i.e. dimension 0 homology). We use TDA to map out phase diagrams for simulated particles with varying adhesion and propulsion, at constant population size as well as when proliferation is permitted. Next, we use this approach to profile our recent experiments on the clustering of epithelial cells in varying growth factor conditions, which are compared to our simulations. Finally, we characterize the robustness of this approach at varying length scales, with sparse sampling, and over time. Overall, we envision TDA will be broadly applicable as a model-agnostic approach to analyze active systems with varying population size, from cytoskeletal motors to motile cells to flocking or swarming animals.

2019

AIP

Analyzing collective motion with machine learning and topology

Dhananjay Bhaskar, Angelika Manhart, Jesse Milzman, John T. Nardini, Kathleen M. Storey, Chad M. Topaz, and Lori Ziegelmeier

Chaos: An Interdisciplinary Journal of Nonlinear Science, Dec 2019

Publisher: American Institute of Physics

Abs Code

We use topological data analysis and machine learning to study a seminal model of collective motion in biology [M. R. D’Orsogna et al., Phys. Rev. Lett. 96, 104302 (2006)]. This model describes agents interacting nonlinearly via attractive-repulsive social forces and gives rise to collective behaviors such as flocking and milling. To classify the emergent collective motion in a large library of numerical simulations and to recover model parameters from the simulation data, we apply machine learning techniques to two different types of input. First, we input time series of order parameters traditionally used in studies of collective motion. Second, we input measures based on topology that summarize the time-varying persistent homology of simulation data over multiple scales. This topological approach does not require prior knowledge of the expected patterns. For both unsupervised and supervised machine learning methods, the topological approach outperforms the one that is based on traditional order parameters.
PNAS

Motility-limited aggregation of mammary epithelial cells into fractal-like clusters

Susan E. Leggett, Zachary J. Neronha, Dhananjay Bhaskar, Jea Yun Sim, Theodora Myrto Perdikari, and Ian Y. Wong

Proceedings of the National Academy of Sciences of the United States of America, Aug 2019

Abs

Migratory cells transition between dispersed individuals and multicellular collectives during development, wound healing, and cancer. These transitions are associated with coordinated behaviors as well as arrested motility at high cell densities, but remain poorly understood at lower cell densities. Here, we show that dispersed mammary epithelial cells organize into arrested, fractal-like clusters at low density in reduced epidermal growth factor (EGF). These clusters exhibit a branched architecture with a fractal dimension of [Formula: see text], reminiscent of diffusion-limited aggregation of nonliving colloidal particles. First, cells display diminished motility in reduced EGF, which permits irreversible adhesion upon cell-cell contact. Subsequently, leader cells emerge that guide collectively migrating strands and connect clusters into space-filling networks. Thus, this living system exhibits gelation-like arrest at low cell densities, analogous to the glass-like arrest of epithelial monolayers at high cell densities. We quantitatively capture these behaviors with a jamming-like phase diagram based on local cell density and EGF. These individual to collective transitions represent an intriguing link between living and nonliving systems, with potential relevance for epithelial morphogenesis into branched architectures.
ACS

Breast Cancer Cells Transition from Mesenchymal to Amoeboid Migration in Tunable Three-Dimensional Silk-Collagen Hydrogels

Amanda S. Khoo, Thomas M. Valentin, Susan E. Leggett, Dhananjay Bhaskar, Elisa M. Bye, Shoham Benmelech, Blanche C. Ip, and Ian Y. Wong

ACS biomaterials science & engineering, Sep 2019

Abs

Invading cancer cells adapt their migration phenotype in response to mechanical and biochemical cues from the extracellular matrix. For instance, mesenchymal migration is associated with strong cell-matrix adhesions and an elongated morphology, while amoeboid migration is associated with minimal cell-matrix adhesions and a rounded morphology. However, it remains challenging to elucidate the role of matrix mechan-ics and biochemistry, since these are both dependent on ECM protein concentration. Here, we demonstrate a composite silk fibroin and collagen I hydrogel where stiffness and microstructure can be systematically tuned over a wide range. Using an overlay assay geometry, we show that the invasion of metastatic breast cancer cells exhibits a biphasic dependence on silk fibroin concentration at fixed collagen I concentration, first increasing as the hydrogel stiffness increases, then decreasing as the pore size of silk fibroin decreases. Indeed, mesenchymal morphology exhibits a similar biphasic depen-dence on silk fibroin concentration, while amoeboid morphologies were favored when cell-matrix adhesions were less effective. We used exogenous biochemical treatment to perturb cells towards increased contractility and a mesenchymal morphology, as well as to disrupt cytoskeletal function and promote an amoeboid morphology. Overall, we envision that this tunable biomaterial platform in a 96-well plate format will be widely applicable to screen cancer cell migration against combinations of designer biomaterials and targeted inhibitors.
RSC

3D printed self-adhesive PEGDA–PAA hydrogels as modular components for soft actuators and microfluidics

Thomas M. Valentin, Eric M. DuBois, Catherine E. Machnicki, Dhananjay Bhaskar, Francis R. Cui, and Ian Y. Wong

Polymer Chemistry, Apr 2019

Publisher: The Royal Society of Chemistry

Abs

Hydrogel building blocks that are stimuli-responsive and self-adhesive could be utilized as a simple “do-it-yourself” construction set for soft machines and microfluidic devices. However, conventional covalently-crosslinked hydrogels are unsuitable since they are as static materials with poor interfacial adhesion. In this article, we demonstrate ion-responsive interchangeable parts based on composite hydrogels that incorporate both covalent and ionic crosslinking. We use light-directed 3D printing to covalently-crosslink poly(ethylene glycol) diacrylate in the presence of anionic poly(acrylic acid) of much higher molecular weight. The addition of trivalent cations acts to crosslink the anionic polymer chains together. Using high cation concentrations drives strong crosslinking, which can result in dramatic hydrogel contraction. Mismatched contraction of layered ion-responsive and non-ion-responsive hydrogels can control bending and twisting actuation, which is utilized for a gripping device. Alternatively, moderate cation concentrations permit strong self-adhesion between hydrogel surfaces. LEGO-like hydrogel blocks with internal channels and external mechanical connectors can be stacked into complex microfluidic device geometries including serpentine micromixers and multilevel architectures. This approach enables “plug-and-play” hydrogel parts for ionic soft machines that mimic actuation, sensing, and fluid transport in living systems.

2018

Phys. Biol.

Coupling mechanical tension and GTPase signaling to generate cell and tissue dynamics

Cole Zmurchok, Dhananjay Bhaskar, and Leah Edelstein-Keshet

Physical Biology, Apr 2018

Abs

Regulators of the actin cytoskeleton such Rho GTPases can modulate forces developed in cells by promoting actomyosin contraction. At the same time, through mechanosensing, tension is known to affect the activity of Rho GTPases. What happens when these effects act in concert? Using a minimal model (1 GTPase coupled to a Kelvin–Voigt element), we show that two-way feedback between signaling (‘RhoA’) and mechanical tension (stretching) leads to a spectrum of cell behaviors, including contracted or relaxed cells, and cells that oscillate between these extremes. When such ‘model cells’ are connected to one another in a row or in a 2D sheet (‘epithelium’), we observe waves of contraction/relaxation and GTPase activity sweeping through the tissue. The minimal model lends itself to full bifurcation analysis, and suggests a mechanism that explains behavior observed in the context of development and collective cell behavior.

2017

PLOS

Polarization and migration in the zebrafish posterior lateral line system

Hildur Knutsdottir, Cole Zmurchok, Dhananjay Bhaskar, Eirikur Palsson, Damian Dalle Nogare, Ajay B. Chitnis, and Leah Edelstein-Keshet

PLOS Computational Biology, Apr 2017

Publisher: Public Library of Science

Abs

Collective cell migration plays an important role in development. Here, we study the posterior lateral line primordium (PLLP) a group of about 100 cells, destined to form sensory structures, that migrates from head to tail in the zebrafish embryo. We model mutually inhibitory FGF-Wnt signalling network in the PLLP and link tissue subdivision (Wnt receptor and FGF receptor activity domains) to receptor-ligand parameters. We then use a 3D cell-based simulation with realistic cell-cell adhesion, interaction forces, and chemotaxis. Our model is able to reproduce experimentally observed motility with leading cells migrating up a gradient of CXCL12a, and trailing (FGF receptor active) cells moving actively by chemotaxis towards FGF ligand secreted by the leading cells. The 3D simulation framework, combined with experiments, allows an investigation of the role of cell division, chemotaxis, adhesion, and other parameters on the shape and speed of the PLLP. The 3D model demonstrates reasonable behaviour of control as well as mutant phenotypes.

Peer-Reviewed CS Conferences

2024

LoG

Inferring Dynamic Regulatory Interaction Graphs From Time Series Data With Perturbations

Dhananjay Bhaskar, Daniel Sumner Magruder, Matheo Morales, Edward De Brouwer, Aarthi Venkat, Frederik Wenkel, James Noonan, Guy Wolf, Natalia Ivanova, and Smita Krishnaswamy

In Proceedings of the Second Learning on Graphs Conference, Apr 2024

ISSN: 2640-3498

Abs

Complex systems are characterized by intricate interactions between entities that evolve dynamically over time. Accurate inference of these dynamic relationships is crucial for understanding and predicting system behavior. In this paper, we propose Regulatory Temporal Interaction Network Inference (RiTINI) for inferring time-varying interaction graphs in complex systems using a novel combination of space-and-time graph attentions and graph neural ordinary differential equations (ODEs). RiTINI leverages time-lapse signals on a graph prior, as well as perturbations of signals at various nodes in order to effectively capture the dynamics of the underlying system. This approach is distinct from traditional causal inference networks, which are limited to inferring acyclic and static graphs. In contrast, RiTINI can infer cyclic, directed, and time-varying graphs, providing a more comprehensive and accurate representation of complex systems. The graph attention mechanism in RiTINI allows the model to adaptively focus on the most relevant interactions in time and space, while the graph neural ODEs enable continuous-time modeling of the system’s dynamics. We evaluate RiTINI’s performance on simulations of dynamical systems, neuronal networks, and gene regulatory networks, demonstrating its state-of-the-art capability in inferring interaction graphs compared to previous methods.

2023

IEEE MLSP

A Flow Artist for High-Dimensional Cellular Data

Kincaid MacDonald, Dhananjay Bhaskar, Guy Thampakkul, Nhi Nguyen, Joia Zhang, Michael Perlmutter, Ian Adelstein, and Smita Krishnaswamy

In 2023 IEEE 33rd International Workshop on Machine Learning for Signal Processing (MLSP), Apr 2023

Abs

We consider the problem of embedding point cloud data sampled from an underlying manifold with an associated flow or velocity. Such data arises in many contexts where static snapshots of dynamic entities are measured, including in high-throughput biology such as single-cell transcriptomics. Existing embedding techniques either do not utilize velocity information or embed the coordinates and velocities independently, i.e., they either impose velocities on top of an existing point embedding or embed points within a prescribed vector field. Here we present FlowArtist, a neural network that embeds points while jointly learning a vector field around the points. The combination allows FlowArtist to better separate and visualize velocity-informed structures. Our results, on toy datasets and single-cell RNA velocity data, illustrate the value of utilizing coordinate and velocity information in tandem for embedding and visualizing high-dimensional data.

2022

IEEE MLSP

Molecular Graph Generation via Geometric Scattering

Dhananjay Bhaskar, Jackson Grady, Egbert Castro, Michael Perlmutter, and Smita Krishnaswamy

In 2022 IEEE 32nd International Workshop on Machine Learning for Signal Processing (MLSP), Aug 2022

ISSN: 2161-0371

Abs

Although existing deep learning models perform remarkably well at predicting physicochemical properties and binding affinities, the generation of new molecules with optimized properties remains challenging. Inherently, most GNNs perform poorly in whole-graph representation due to the limitations of the message-passing paradigm. Furthermore, step-by-step graph generation frameworks that use reinforcement learning or other sequential processing can be slow and result in a high proportion of invalid molecules with substantial post-processing needed in order to generate valid molecules. To address these issues, we propose a representation-first approach to molecular graph generation. We guide the latent representation of an autoencoder by capturing graph structure information with the geometric scattering transform and apply penalties that organize the representation by molecular properties. We show that this highly structured latent space can be directly used for molecular graph generation by the use of a GAN. We demonstrate that our architecture learns meaningful representations of drug datasets and provides a platform for drug synthesis using publicly available ZINC and BindingDB datasets.
NeurIPS

Diffusion Curvature for Estimating Local Curvature in High Dimensional Data

Dhananjay Bhaskar, Kincaid MacDonald, Oluwadamilola Fasina, Dawson Thomas, Bastian Rieck, Ian Adelstein, and Smita Krishnaswamy

Advances in Neural Information Processing Systems, Dec 2022

Reviews

2023

Review

Multiscale geometric and topological analyses for characterizing and predicting immune responses from single cell data

Aarthi Venkat, Dhananjay Bhaskar, and Smita Krishnaswamy

Trends in Immunology, Jul 2023

Publisher: Elsevier

2022

Review

Current trends in artificial intelligence in reproductive endocrinology

Dhananjay Bhaskar, T. Arthur Chang, and Shunping Wang

Current Opinion in Obstetrics & Gynecology, Aug 2022

Abs

In this review, we discuss various applications of artificial intelligence in different areas of reproductive medicine. We summarize the current findings with their potentials and limitations, and also discuss the future direction for research and clinical applications.
Review

The need for speed: Migratory cells in tight spaces boost their molecular clock

Dhananjay Bhaskar, Alex M. Hruska, and Ian Y. Wong

Cell Systems, Jul 2022

Abs

Cells migrating in spatial confinement exhibit higher intracellular calcium levels, which increases the oscillation frequency of a “molecular clock” that synchronizes guanine nucleotide exchange factor GEF-H1 and microtubule polymerization for more frequent bursts of speed.

Preprints

2024

arXiv

ProtSCAPE: Mapping the landscape of protein conformations in molecular dynamics

Siddharth Viswanath, Dhananjay Bhaskar, David R. Johnson, Joao Felipe Rocha, Egbert Castro, Jackson D. Grady, Alex T. Grigas, Michael A. Perlmutter, Corey S. O’Hern, and Smita Krishnaswamy

Oct 2024

arXiv:2410.20317

Abs

Understanding the dynamic nature of protein structures is essential for comprehending their biological functions. While significant progress has been made in predicting static folded structures, modeling protein motions on microsecond to millisecond scales remains challenging. To address these challenges, we introduce a novel deep learning architecture, Protein Transformer with Scattering, Attention, and Positional Embedding (ProtSCAPE), which leverages the geometric scattering transform alongside transformer-based attention mechanisms to capture protein dynamics from molecular dynamics (MD) simulations. ProtSCAPE utilizes the multi-scale nature of the geometric scattering transform to extract features from protein structures conceptualized as graphs and integrates these features with dual attention structures that focus on residues and amino acid signals, generating latent representations of protein trajectories. Furthermore, ProtSCAPE incorporates a regression head to enforce temporally coherent latent representations.
bioRxiv

Neuro-GSTH: A Geometric Scattering and Persistent Homology Framework for Uncovering Spatiotemporal Signatures in Neural Activity

Dhananjay Bhaskar, Jessica Moore, Yanlei Zhang, Feng Gao, Bastian Rieck, Guy Wolf, Helen Pushkarskaya, Firas Khasawneh, Elizabeth Munch, Valentina Greco, Christopher Pittenger, and Smita Krishnaswamy

Oct 2024

Abs

Understanding how neurons communicate and coordinate their activity is essential for unraveling the brain’s complex functionality. To analyze the intricate spatiotemporal dynamics of neural signaling, we developed Geometric Scattering Trajectory Homology (neuro-GSTH), a novel framework that captures time-evolving neural signals and encodes them into low-dimensional representations. GSTH integrates geometric scattering transforms, which extract multiscale features from brain signals modeled on anatomical graphs, with t-PHATE, a manifold learning method that maps the temporal evolution of neural activity. Topological descriptors from computational homology are then applied to characterize the global structure of these neural trajectories, enabling the quantification and differentiation of spatiotemporal brain dynamics. We demonstrate the power of neuro-GSTH in neuroscience by applying it to both simulated and biological neural datasets. First, we used neuro-GSTH to analyze neural oscillatory behavior in the Kuramoto model, revealing its capacity to track the synchronization of neural circuits as coupling strength increases. Next, we applied neuro-GSTH to neural recordings from the visual cortex of mice, where it accurately reconstructed visual stimulus patterns such as sinusoidal gratings. Neuro-GSTH-derived neural trajectories enabled precise classification of stimulus properties like spatial frequency and orientation, significantly outperforming traditional methods in capturing the underlying neural dynamics. These findings demonstrate that neuro-GSTH effectively identifies neural motifs—distinct patterns of spatiotemporal activity—providing a powerful tool for decoding brain activity across diverse tasks, sensory inputs, and neurological disorders. Neuro-GSTH thus offers new insights into neural communication and dynamics, advancing our ability to map and understand complex brain functions.
bioRxiv

Generative modeling of biological shapes and images using a probabilistic α-shape sampler

Emily T. Winn-Nuñez, Hadley Witt, Dhananjay Bhaskar, Ryan Y. Huang, Jonathan S. Reichner, Ian Y. Wong, and Lorin Crawford

Jan 2024

Abs

Understanding morphological variation is an important task in many areas of computational biology. Recent studies have focused on developing computational tools for the task of sub-image selection which aims at identifying structural features that best describe the variation between classes of shapes. A major part in assessing the utility of these approaches is to demonstrate their performance on both simulated and real datasets. However, when creating a model for shape statistics, real data can be difficult to access and the sample sizes for these data are often small due to them being expensive to collect. Meanwhile, the current landscape of generative models for shapes has been mostly limited to approaches that use black-box inference—making it difficult to systematically assess the power and calibration of sub-image models. In this paper, we introduce the α-shape sampler: a probabilistic framework for generating realistic 2D and 3D shapes based on probability distributions which can be learned from real data. We demonstrate our framework using proof-of-concept examples and in two real applications in biology where we generate (i) 2D images of healthy and septic neutrophils and (ii) 3D computed tomography (CT) scans of primate mandibular molars. The α-shape sampler R package is open-source and can be downloaded at https://github.com/lcrawlab/ashapesampler., Using shapes and images to understand genotypic and phenotypic variation has proven to be an effective strategy in many biological applications. Unfortunately, shape data can be expensive to collect and, as a result, sample sizes for analyses are often small. Despite methodological advancements in shape statistics and machine learning, benchmarking standards for evaluating new computational tools via data simulation is still underdeveloped. In this paper, we present a probability-based pipeline called the α-shape sampler which has the flexibility to generate new and unobserved shapes based on an input set of data. We extensively evaluate the generative capabilities of our pipeline using 2D cellular images of neutrophils and 3D mandibular molars from two different suborders of primates.
bioRxiv

NeuroSCAN: Exploring Neurodevelopment via Spatiotemporal Collation of Anatomical Networks

Noelle L. Koonce, Sarah E. Emerson, Dhananjay Bhaskar, Manik Kuchroo, Mark W. Moyle, Pura Arroyo-Morales, Nabor Vázquez Martínez, Smita Krishnaswamy, William Mohler, and Daniel Colón-Ramos

Aug 2024

Abs

Volume electron microscopy (vEM) datasets such as those generated for connectome studies allow nanoscale quantifications and comparisons of the cell biological features underpinning circuit architectures. Quantifications of cell biological relationships in the connectome result in rich multidimensional datasets that benefit from data science approaches, including dimensionality reduction and integrated graphical representations of neuronal relationships. We developed NeuroSCAN, an online open- source platform that bridges sophisticated graph analytics from data science approaches with the underlying cell biological features in the connectome. We apply NeuroSCAN to a complete published record of C. elegans brain neuropils and demonstrate how these integrated representations of neuronal relationships facilitate comparisons across connectomes, catalyzing new insights on the structure-function of the circuits and their changes during development. NeuroSCAN is designed for intuitive examination and comparisons across connectomes, enabling synthesis of knowledge from high-level abstractions of neuronal relationships derived from data science techniques to the detailed identification of the cell biological features underpinning these abstractions.
arXiv

Latent Representation Learning for Multimodal Brain Activity Translation

Arman Afrasiyabi, Dhananjay Bhaskar, Erica L. Busch, Laurent Caplette, Rahul Singh, Guillaume Lajoie, Nicholas B. Turk-Browne, and Smita Krishnaswamy

Sep 2024

arXiv:2409.18462 [cs, q-bio]

Abs

Neuroscience employs diverse neuroimaging techniques, each offering distinct insights into brain activity, from electrophysiological recordings such as EEG, which have high temporal resolution, to hemodynamic modalities such as fMRI, which have increased spatial precision. However, integrating these heterogeneous data sources remains a challenge, which limits a comprehensive understanding of brain function. We present the Spatiotemporal Alignment of Multimodal Brain Activity (SAMBA) framework, which bridges the spatial and temporal resolution gaps across modalities by learning a unified latent space free of modality-specific biases. SAMBA introduces a novel attention-based wavelet decomposition for spectral filtering of electrophysiological recordings, graph attention networks to model functional connectivity between functional brain units, and recurrent layers to capture temporal autocorrelations in brain signal. We show that the training of SAMBA, aside from achieving translation, also learns a rich representation of brain information processing. We showcase this classify external stimuli driving brain activity from the representation learned in hidden layers of SAMBA, paving the way for broad downstream applications in neuroscience research and clinical contexts.
arXiv

Looking through the mind’s eye via multimodal encoder-decoder networks

Arman Afrasiyabi, Erica Busch, Rahul Singh, Dhananjay Bhaskar, Laurent Caplette, Nicholas Turk-Browne, and Smita Krishnaswamy

Sep 2024

arXiv:2410.00047 [cs, eess, q-bio]

Abs

In this work, we explore the decoding of mental imagery from subjects using their fMRI measurements. In order to achieve this decoding, we first created a mapping between a subject’s fMRI signals elicited by the videos the subjects watched. This mapping associates the high dimensional fMRI activation states with visual imagery. Next, we prompted the subjects textually, primarily with emotion labels which had no direct reference to visual objects. Then to decode visual imagery that may have been in a person’s mind’s eye, we align a latent representation of these fMRI measurements with a corresponding video-fMRI based on textual labels given to the videos themselves. This alignment has the effect of overlapping the video fMRI embedding with the text-prompted fMRI embedding, thus allowing us to use our fMRI-to-video mapping to decode. Additionally, we enhance an existing fMRI dataset, initially consisting of data from five subjects, by including recordings from three more subjects gathered by our team. We demonstrate the efficacy of our model on this augmented dataset both in accurately creating a mapping, as well as in plausibly decoding mental imagery.

2023

arXiv

Graph topological property recovery with heat and wave dynamics-based features on graphs

Dhananjay Bhaskar, Yanlei Zhang, Charles Xu, Xingzhi Sun, Oluwadamilola Fasina, Guy Wolf, Maximilian Nickel, Michael Perlmutter, and Smita Krishnaswamy

Sep 2023

arXiv:2309.09924 [cs, eess, stat]

Abs

In this paper, we propose Graph Differential Equation Network (GDeNet), an approach that harnesses the expressive power of solutions to PDEs on a graph to obtain continuous node- and graph-level representations for various downstream tasks. We derive theoretical results connecting the dynamics of heat and wave equations to the spectral properties of the graph and to the behavior of continuous-time random walks on graphs. We demonstrate experimentally that these dynamics are able to capture salient aspects of graph geometry and topology by recovering generating parameters of random graphs, Ricci curvature, and persistent homology. Furthermore, we demonstrate the superior performance of GDeNet on real-world datasets including citation graphs, drug-like molecules, and proteins.