Publications
Journal Articles
2024
- IEEE TOSPLearnable Filters for Geometric Scattering ModulesAlexander Tong, Frederik Wenkel, Dhananjay Bhaskar, Kincaid Macdonald, Jackson Grady, Michael Perlmutter, Smita Krishnaswamy, and Guy WolfIEEE Transactions on Signal Processing, 2024Conference Name: IEEE Transactions on Signal Processing
We propose a new graph neural network (GNN) module, based on relaxations of recently proposed geometric scattering transforms, which consist of a cascade of graph wavelet filters. Our learnable geometric scattering (LEGS) module enables adaptive tuning of the wavelets to encourage band-pass features to emerge in learned representations. The incorporation of our LEGS-module in GNNs enables the learning of longer-range graph relations compared to many popular GNNs, which often rely on encoding graph structure via smoothness or similarity between neighbors. Further, its wavelet priors result in simplified architectures with significantly fewer learned parameters compared to competing GNNs. We demonstrate the predictive performance of LEGS-based networks on graph classification benchmarks, as well as the descriptive quality of their learned features in biochemical graph data exploration tasks. Our results show that LEGS-based networks match or outperforms popular GNNs, as well as the original geometric scattering construction, on many datasets, in particular in biochemical domains, while retaining certain mathematical properties of handcrafted (non-learned) geometric scattering.
- Sci RepDissecting glial scar formation by spatial point pattern and topological data analysisDaniel Manrique-Castano, Dhananjay Bhaskar, and Ayman ElAliScientific Reports, Aug 2024Publisher: Nature Publishing Group
Glial scar formation represents a fundamental response to central nervous system (CNS) injuries. It is mainly characterized by a well-defined spatial rearrangement of reactive astrocytes and microglia. The mechanisms underlying glial scar formation have been extensively studied, yet quantitative descriptors of the spatial arrangement of reactive glial cells remain limited. Here, we present a novel approach using point pattern analysis (PPA) and topological data analysis (TDA) to quantify spatial patterns of reactive glial cells after experimental ischemic stroke in mice. We provide open and reproducible tools using R and Julia to quantify spatial intensity, cell covariance and conditional distribution, cell-to-cell interactions, and short/long-scale arrangement, which collectively disentangle the arrangement patterns of the glial scar. This approach unravels a substantial divergence in the distribution of GFAP+ and IBA1+ cells after injury that conventional analysis methods cannot fully characterize. PPA and TDA are valuable tools for studying the complex spatial arrangement of reactive glia and other nervous cells following CNS injuries and have potential applications for evaluating glial-targeted restorative therapies.
2023
- JCBCell cycle controls long-range calcium signaling in the regenerating epidermisJessica L. Moore, Dhananjay Bhaskar, Feng Gao, Catherine Matte-Martone, Shuangshuang Du, Elizabeth Lathrop, Smirthy Ganesan, Lin Shao, Rachael Norris, Nil Campamà Sanz, Karl Annusver, Maria Kasper, Andy Cox, Caroline Hendry, Bastian Rieck, Smita Krishnaswamy, and Valentina GrecoThe Journal of Cell Biology, Jul 2023
Skin homeostasis is maintained by stem cells, which must communicate to balance their regenerative behaviors. Yet, how adult stem cells signal across regenerative tissue remains unknown due to challenges in studying signaling dynamics in live mice. We combined live imaging in the mouse basal stem cell layer with machine learning tools to analyze patterns of Ca2+ signaling. We show that basal cells display dynamic intercellular Ca2+ signaling among local neighborhoods. We find that these Ca2+ signals are coordinated across thousands of cells and that this coordination is an emergent property of the stem cell layer. We demonstrate that G2 cells are required to initiate normal levels of Ca2+ signaling, while connexin43 connects basal cells to orchestrate tissue-wide coordination of Ca2+ signaling. Lastly, we find that Ca2+ signaling drives cell cycle progression, revealing a communication feedback loop. This work provides resolution into how stem cells at different cell cycle stages coordinate tissue-wide signaling during epidermal regeneration.
- NPJTopological data analysis of spatial patterning in heterogeneous cell populations: clustering and sorting with varying cell-cell adhesionDhananjay Bhaskar, William Y. Zhang, Alexandria Volkening, Björn Sandstede, and Ian Y. Wongnpj Systems Biology and Applications, Sep 2023Number: 1 Publisher: Nature Publishing Group
Different cell types aggregate and sort into hierarchical architectures during the formation of animal tissues. The resulting spatial organization depends (in part) on the strength of adhesion of one cell type to itself relative to other cell types. However, automated and unsupervised classification of these multicellular spatial patterns remains challenging, particularly given their structural diversity and biological variability. Recent developments based on topological data analysis are intriguing to reveal similarities in tissue architecture, but these methods remain computationally expensive. In this article, we show that multicellular patterns organized from two interacting cell types can be efficiently represented through persistence images. Our optimized combination of dimensionality reduction via autoencoders, combined with hierarchical clustering, achieved high classification accuracy for simulations with constant cell numbers. We further demonstrate that persistence images can be normalized to improve classification for simulations with varying cell numbers due to proliferation. Finally, we systematically consider the importance of incorporating different topological features as well as information about each cell type to improve classification accuracy. We envision that topological machine learning based on persistence images will enable versatile and robust classification of complex tissue architectures that occur in development and disease.
2022
- NatureTransformer-based protein generation with regularized latent space optimizationEgbert Castro, Abhinav Godavarthi, Julian Rubinfien, Kevin Givechian, Dhananjay Bhaskar, and Smita KrishnaswamyNature Machine Intelligence, Oct 2022Number: 10 Publisher: Nature Publishing Group
The development of powerful natural language models has improved the ability to learn meaningful representations of protein sequences. In addition, advances in high-throughput mutagenesis, directed evolution and next-generation sequencing have allowed for the accumulation of large amounts of labelled fitness data. Leveraging these two trends, we introduce Regularized Latent Space Optimization (ReLSO), a deep transformer-based autoencoder, which features a highly structured latent space that is trained to jointly generate sequences as well as predict fitness. Through regularized prediction heads, ReLSO introduces a powerful protein sequence encoder and a novel approach for efficient fitness landscape traversal. Using ReLSO, we explicitly model the sequence–function landscape of large labelled datasets and generate new molecules by optimizing within the latent space using gradient-based methods. We evaluate this approach on several publicly available protein datasets, including variant sets of anti-ranibizumab and green fluorescent protein. We observe a greater sequence optimization efficiency (increase in fitness per optimization step) using ReLSO compared with other approaches, where ReLSO more robustly generates high-fitness sequences. Furthermore, the attention-based relationships learned by the jointly trained ReLSO models provide a potential avenue towards sequence-level fitness attribution information.
2021
- RSCTopological data analysis of collective and individual epithelial cells using persistent homology of loopsDhananjay Bhaskar, William Y. Zhang, and Ian Y. WongSoft Matter, May 2021Publisher: The Royal Society of Chemistry
Interacting, self-propelled particles such as epithelial cells can dynamically self-organize into complex multicellular patterns, which are challenging to classify without a priori information. Classically, different phases and phase transitions have been described based on local ordering, which may not capture structural features at larger length scales. Instead, topological data analysis (TDA) determines the stability of spatial connectivity at varying length scales (i.e. persistent homology), and can compare different particle configurations based on the “cost” of reorganizing one configuration into another. Here, we demonstrate a topology-based machine learning approach for unsupervised profiling of individual and collective phases based on large-scale loops. We show that these topological loops (i.e. dimension 1 homology) are robust to variations in particle number and density, particularly in comparison to connected components (i.e. dimension 0 homology). We use TDA to map out phase diagrams for simulated particles with varying adhesion and propulsion, at constant population size as well as when proliferation is permitted. Next, we use this approach to profile our recent experiments on the clustering of epithelial cells in varying growth factor conditions, which are compared to our simulations. Finally, we characterize the robustness of this approach at varying length scales, with sparse sampling, and over time. Overall, we envision TDA will be broadly applicable as a model-agnostic approach to analyze active systems with varying population size, from cytoskeletal motors to motile cells to flocking or swarming animals.
2019
- AIPAnalyzing collective motion with machine learning and topologyDhananjay Bhaskar, Angelika Manhart, Jesse Milzman, John T. Nardini, Kathleen M. Storey, Chad M. Topaz, and Lori ZiegelmeierChaos: An Interdisciplinary Journal of Nonlinear Science, Dec 2019Publisher: American Institute of Physics
We use topological data analysis and machine learning to study a seminal model of collective motion in biology [M. R. D’Orsogna et al., Phys. Rev. Lett. 96, 104302 (2006)]. This model describes agents interacting nonlinearly via attractive-repulsive social forces and gives rise to collective behaviors such as flocking and milling. To classify the emergent collective motion in a large library of numerical simulations and to recover model parameters from the simulation data, we apply machine learning techniques to two different types of input. First, we input time series of order parameters traditionally used in studies of collective motion. Second, we input measures based on topology that summarize the time-varying persistent homology of simulation data over multiple scales. This topological approach does not require prior knowledge of the expected patterns. For both unsupervised and supervised machine learning methods, the topological approach outperforms the one that is based on traditional order parameters.
- PNASMotility-limited aggregation of mammary epithelial cells into fractal-like clustersSusan E. Leggett, Zachary J. Neronha, Dhananjay Bhaskar, Jea Yun Sim, Theodora Myrto Perdikari, and Ian Y. WongProceedings of the National Academy of Sciences of the United States of America, Aug 2019
Migratory cells transition between dispersed individuals and multicellular collectives during development, wound healing, and cancer. These transitions are associated with coordinated behaviors as well as arrested motility at high cell densities, but remain poorly understood at lower cell densities. Here, we show that dispersed mammary epithelial cells organize into arrested, fractal-like clusters at low density in reduced epidermal growth factor (EGF). These clusters exhibit a branched architecture with a fractal dimension of [Formula: see text], reminiscent of diffusion-limited aggregation of nonliving colloidal particles. First, cells display diminished motility in reduced EGF, which permits irreversible adhesion upon cell-cell contact. Subsequently, leader cells emerge that guide collectively migrating strands and connect clusters into space-filling networks. Thus, this living system exhibits gelation-like arrest at low cell densities, analogous to the glass-like arrest of epithelial monolayers at high cell densities. We quantitatively capture these behaviors with a jamming-like phase diagram based on local cell density and EGF. These individual to collective transitions represent an intriguing link between living and nonliving systems, with potential relevance for epithelial morphogenesis into branched architectures.
- ACSBreast Cancer Cells Transition from Mesenchymal to Amoeboid Migration in Tunable Three-Dimensional Silk-Collagen HydrogelsAmanda S. Khoo, Thomas M. Valentin, Susan E. Leggett, Dhananjay Bhaskar, Elisa M. Bye, Shoham Benmelech, Blanche C. Ip, and Ian Y. WongACS biomaterials science & engineering, Sep 2019
Invading cancer cells adapt their migration phenotype in response to mechanical and biochemical cues from the extracellular matrix. For instance, mesenchymal migration is associated with strong cell-matrix adhesions and an elongated morphology, while amoeboid migration is associated with minimal cell-matrix adhesions and a rounded morphology. However, it remains challenging to elucidate the role of matrix mechan-ics and biochemistry, since these are both dependent on ECM protein concentration. Here, we demonstrate a composite silk fibroin and collagen I hydrogel where stiffness and microstructure can be systematically tuned over a wide range. Using an overlay assay geometry, we show that the invasion of metastatic breast cancer cells exhibits a biphasic dependence on silk fibroin concentration at fixed collagen I concentration, first increasing as the hydrogel stiffness increases, then decreasing as the pore size of silk fibroin decreases. Indeed, mesenchymal morphology exhibits a similar biphasic depen-dence on silk fibroin concentration, while amoeboid morphologies were favored when cell-matrix adhesions were less effective. We used exogenous biochemical treatment to perturb cells towards increased contractility and a mesenchymal morphology, as well as to disrupt cytoskeletal function and promote an amoeboid morphology. Overall, we envision that this tunable biomaterial platform in a 96-well plate format will be widely applicable to screen cancer cell migration against combinations of designer biomaterials and targeted inhibitors.
- RSC3D printed self-adhesive PEGDA–PAA hydrogels as modular components for soft actuators and microfluidicsThomas M. Valentin, Eric M. DuBois, Catherine E. Machnicki, Dhananjay Bhaskar, Francis R. Cui, and Ian Y. WongPolymer Chemistry, Apr 2019Publisher: The Royal Society of Chemistry
Hydrogel building blocks that are stimuli-responsive and self-adhesive could be utilized as a simple “do-it-yourself” construction set for soft machines and microfluidic devices. However, conventional covalently-crosslinked hydrogels are unsuitable since they are as static materials with poor interfacial adhesion. In this article, we demonstrate ion-responsive interchangeable parts based on composite hydrogels that incorporate both covalent and ionic crosslinking. We use light-directed 3D printing to covalently-crosslink poly(ethylene glycol) diacrylate in the presence of anionic poly(acrylic acid) of much higher molecular weight. The addition of trivalent cations acts to crosslink the anionic polymer chains together. Using high cation concentrations drives strong crosslinking, which can result in dramatic hydrogel contraction. Mismatched contraction of layered ion-responsive and non-ion-responsive hydrogels can control bending and twisting actuation, which is utilized for a gripping device. Alternatively, moderate cation concentrations permit strong self-adhesion between hydrogel surfaces. LEGO-like hydrogel blocks with internal channels and external mechanical connectors can be stacked into complex microfluidic device geometries including serpentine micromixers and multilevel architectures. This approach enables “plug-and-play” hydrogel parts for ionic soft machines that mimic actuation, sensing, and fluid transport in living systems.
2018
- Phys. Biol.Coupling mechanical tension and GTPase signaling to generate cell and tissue dynamicsCole Zmurchok, Dhananjay Bhaskar, and Leah Edelstein-KeshetPhysical Biology, Apr 2018
Regulators of the actin cytoskeleton such Rho GTPases can modulate forces developed in cells by promoting actomyosin contraction. At the same time, through mechanosensing, tension is known to affect the activity of Rho GTPases. What happens when these effects act in concert? Using a minimal model (1 GTPase coupled to a Kelvin–Voigt element), we show that two-way feedback between signaling (‘RhoA’) and mechanical tension (stretching) leads to a spectrum of cell behaviors, including contracted or relaxed cells, and cells that oscillate between these extremes. When such ‘model cells’ are connected to one another in a row or in a 2D sheet (‘epithelium’), we observe waves of contraction/relaxation and GTPase activity sweeping through the tissue. The minimal model lends itself to full bifurcation analysis, and suggests a mechanism that explains behavior observed in the context of development and collective cell behavior.
2017
- PLOSPolarization and migration in the zebrafish posterior lateral line systemHildur Knutsdottir, Cole Zmurchok, Dhananjay Bhaskar, Eirikur Palsson, Damian Dalle Nogare, Ajay B. Chitnis, and Leah Edelstein-KeshetPLOS Computational Biology, Apr 2017Publisher: Public Library of Science
Collective cell migration plays an important role in development. Here, we study the posterior lateral line primordium (PLLP) a group of about 100 cells, destined to form sensory structures, that migrates from head to tail in the zebrafish embryo. We model mutually inhibitory FGF-Wnt signalling network in the PLLP and link tissue subdivision (Wnt receptor and FGF receptor activity domains) to receptor-ligand parameters. We then use a 3D cell-based simulation with realistic cell-cell adhesion, interaction forces, and chemotaxis. Our model is able to reproduce experimentally observed motility with leading cells migrating up a gradient of CXCL12a, and trailing (FGF receptor active) cells moving actively by chemotaxis towards FGF ligand secreted by the leading cells. The 3D simulation framework, combined with experiments, allows an investigation of the role of cell division, chemotaxis, adhesion, and other parameters on the shape and speed of the PLLP. The 3D model demonstrates reasonable behaviour of control as well as mutant phenotypes.
Peer-Reviewed CS Conferences
2024
- LoGInferring Dynamic Regulatory Interaction Graphs From Time Series Data With PerturbationsDhananjay Bhaskar, Daniel Sumner Magruder, Matheo Morales, Edward De Brouwer, Aarthi Venkat, Frederik Wenkel, James Noonan, Guy Wolf, Natalia Ivanova, and Smita KrishnaswamyIn Proceedings of the Second Learning on Graphs Conference, Apr 2024ISSN: 2640-3498
Complex systems are characterized by intricate interactions between entities that evolve dynamically over time. Accurate inference of these dynamic relationships is crucial for understanding and predicting system behavior. In this paper, we propose Regulatory Temporal Interaction Network Inference (RiTINI) for inferring time-varying interaction graphs in complex systems using a novel combination of space-and-time graph attentions and graph neural ordinary differential equations (ODEs). RiTINI leverages time-lapse signals on a graph prior, as well as perturbations of signals at various nodes in order to effectively capture the dynamics of the underlying system. This approach is distinct from traditional causal inference networks, which are limited to inferring acyclic and static graphs. In contrast, RiTINI can infer cyclic, directed, and time-varying graphs, providing a more comprehensive and accurate representation of complex systems. The graph attention mechanism in RiTINI allows the model to adaptively focus on the most relevant interactions in time and space, while the graph neural ODEs enable continuous-time modeling of the system’s dynamics. We evaluate RiTINI’s performance on simulations of dynamical systems, neuronal networks, and gene regulatory networks, demonstrating its state-of-the-art capability in inferring interaction graphs compared to previous methods.
2023
- IEEE MLSPA Flow Artist for High-Dimensional Cellular DataKincaid MacDonald, Dhananjay Bhaskar, Guy Thampakkul, Nhi Nguyen, Joia Zhang, Michael Perlmutter, Ian Adelstein, and Smita KrishnaswamyIn 2023 IEEE 33rd International Workshop on Machine Learning for Signal Processing (MLSP), Apr 2023
We consider the problem of embedding point cloud data sampled from an underlying manifold with an associated flow or velocity. Such data arises in many contexts where static snapshots of dynamic entities are measured, including in high-throughput biology such as single-cell transcriptomics. Existing embedding techniques either do not utilize velocity information or embed the coordinates and velocities independently, i.e., they either impose velocities on top of an existing point embedding or embed points within a prescribed vector field. Here we present FlowArtist, a neural network that embeds points while jointly learning a vector field around the points. The combination allows FlowArtist to better separate and visualize velocity-informed structures. Our results, on toy datasets and single-cell RNA velocity data, illustrate the value of utilizing coordinate and velocity information in tandem for embedding and visualizing high-dimensional data.
2022
- IEEE MLSPMolecular Graph Generation via Geometric ScatteringDhananjay Bhaskar, Jackson Grady, Egbert Castro, Michael Perlmutter, and Smita KrishnaswamyIn 2022 IEEE 32nd International Workshop on Machine Learning for Signal Processing (MLSP), Aug 2022ISSN: 2161-0371
Although existing deep learning models perform remarkably well at predicting physicochemical properties and binding affinities, the generation of new molecules with optimized properties remains challenging. Inherently, most GNNs perform poorly in whole-graph representation due to the limitations of the message-passing paradigm. Furthermore, step-by-step graph generation frameworks that use reinforcement learning or other sequential processing can be slow and result in a high proportion of invalid molecules with substantial post-processing needed in order to generate valid molecules. To address these issues, we propose a representation-first approach to molecular graph generation. We guide the latent representation of an autoencoder by capturing graph structure information with the geometric scattering transform and apply penalties that organize the representation by molecular properties. We show that this highly structured latent space can be directly used for molecular graph generation by the use of a GAN. We demonstrate that our architecture learns meaningful representations of drug datasets and provides a platform for drug synthesis using publicly available ZINC and BindingDB datasets.
- NeurIPSDiffusion Curvature for Estimating Local Curvature in High Dimensional DataDhananjay Bhaskar, Kincaid MacDonald, Oluwadamilola Fasina, Dawson Thomas, Bastian Rieck, Ian Adelstein, and Smita KrishnaswamyAdvances in Neural Information Processing Systems, Dec 2022
Reviews
2023
- ReviewMultiscale geometric and topological analyses for characterizing and predicting immune responses from single cell dataAarthi Venkat, Dhananjay Bhaskar, and Smita KrishnaswamyTrends in Immunology, Jul 2023Publisher: Elsevier
2022
- ReviewCurrent trends in artificial intelligence in reproductive endocrinologyDhananjay Bhaskar, T. Arthur Chang, and Shunping WangCurrent Opinion in Obstetrics & Gynecology, Aug 2022
In this review, we discuss various applications of artificial intelligence in different areas of reproductive medicine. We summarize the current findings with their potentials and limitations, and also discuss the future direction for research and clinical applications.
- ReviewThe need for speed: Migratory cells in tight spaces boost their molecular clockDhananjay Bhaskar, Alex M. Hruska, and Ian Y. WongCell Systems, Jul 2022
Cells migrating in spatial confinement exhibit higher intracellular calcium levels, which increases the oscillation frequency of a “molecular clock” that synchronizes guanine nucleotide exchange factor GEF-H1 and microtubule polymerization for more frequent bursts of speed.
Preprints
2024
- arXivProtSCAPE: Mapping the landscape of protein conformations in molecular dynamicsSiddharth Viswanath, Dhananjay Bhaskar, David R. Johnson, Joao Felipe Rocha, Egbert Castro, Jackson D. Grady, Alex T. Grigas, Michael A. Perlmutter, Corey S. O’Hern, and Smita KrishnaswamyOct 2024arXiv:2410.20317
Understanding the dynamic nature of protein structures is essential for comprehending their biological functions. While significant progress has been made in predicting static folded structures, modeling protein motions on microsecond to millisecond scales remains challenging. To address these challenges, we introduce a novel deep learning architecture, Protein Transformer with Scattering, Attention, and Positional Embedding (ProtSCAPE), which leverages the geometric scattering transform alongside transformer-based attention mechanisms to capture protein dynamics from molecular dynamics (MD) simulations. ProtSCAPE utilizes the multi-scale nature of the geometric scattering transform to extract features from protein structures conceptualized as graphs and integrates these features with dual attention structures that focus on residues and amino acid signals, generating latent representations of protein trajectories. Furthermore, ProtSCAPE incorporates a regression head to enforce temporally coherent latent representations.
- bioRxivNeuro-GSTH: A Geometric Scattering and Persistent Homology Framework for Uncovering Spatiotemporal Signatures in Neural ActivityDhananjay Bhaskar, Jessica Moore, Yanlei Zhang, Feng Gao, Bastian Rieck, Guy Wolf, Helen Pushkarskaya, Firas Khasawneh, Elizabeth Munch, Valentina Greco, Christopher Pittenger, and Smita KrishnaswamyOct 2024
Understanding how neurons communicate and coordinate their activity is essential for unraveling the brain’s complex functionality. To analyze the intricate spatiotemporal dynamics of neural signaling, we developed Geometric Scattering Trajectory Homology (neuro-GSTH), a novel framework that captures time-evolving neural signals and encodes them into low-dimensional representations. GSTH integrates geometric scattering transforms, which extract multiscale features from brain signals modeled on anatomical graphs, with t-PHATE, a manifold learning method that maps the temporal evolution of neural activity. Topological descriptors from computational homology are then applied to characterize the global structure of these neural trajectories, enabling the quantification and differentiation of spatiotemporal brain dynamics. We demonstrate the power of neuro-GSTH in neuroscience by applying it to both simulated and biological neural datasets. First, we used neuro-GSTH to analyze neural oscillatory behavior in the Kuramoto model, revealing its capacity to track the synchronization of neural circuits as coupling strength increases. Next, we applied neuro-GSTH to neural recordings from the visual cortex of mice, where it accurately reconstructed visual stimulus patterns such as sinusoidal gratings. Neuro-GSTH-derived neural trajectories enabled precise classification of stimulus properties like spatial frequency and orientation, significantly outperforming traditional methods in capturing the underlying neural dynamics. These findings demonstrate that neuro-GSTH effectively identifies neural motifs—distinct patterns of spatiotemporal activity—providing a powerful tool for decoding brain activity across diverse tasks, sensory inputs, and neurological disorders. Neuro-GSTH thus offers new insights into neural communication and dynamics, advancing our ability to map and understand complex brain functions.
- bioRxivGenerative modeling of biological shapes and images using a probabilistic α-shape samplerEmily T. Winn-Nuñez, Hadley Witt, Dhananjay Bhaskar, Ryan Y. Huang, Jonathan S. Reichner, Ian Y. Wong, and Lorin CrawfordJan 2024
Understanding morphological variation is an important task in many areas of computational biology. Recent studies have focused on developing computational tools for the task of sub-image selection which aims at identifying structural features that best describe the variation between classes of shapes. A major part in assessing the utility of these approaches is to demonstrate their performance on both simulated and real datasets. However, when creating a model for shape statistics, real data can be difficult to access and the sample sizes for these data are often small due to them being expensive to collect. Meanwhile, the current landscape of generative models for shapes has been mostly limited to approaches that use black-box inference—making it difficult to systematically assess the power and calibration of sub-image models. In this paper, we introduce the α-shape sampler: a probabilistic framework for generating realistic 2D and 3D shapes based on probability distributions which can be learned from real data. We demonstrate our framework using proof-of-concept examples and in two real applications in biology where we generate (i) 2D images of healthy and septic neutrophils and (ii) 3D computed tomography (CT) scans of primate mandibular molars. The α-shape sampler R package is open-source and can be downloaded at https://github.com/lcrawlab/ashapesampler., Using shapes and images to understand genotypic and phenotypic variation has proven to be an effective strategy in many biological applications. Unfortunately, shape data can be expensive to collect and, as a result, sample sizes for analyses are often small. Despite methodological advancements in shape statistics and machine learning, benchmarking standards for evaluating new computational tools via data simulation is still underdeveloped. In this paper, we present a probability-based pipeline called the α-shape sampler which has the flexibility to generate new and unobserved shapes based on an input set of data. We extensively evaluate the generative capabilities of our pipeline using 2D cellular images of neutrophils and 3D mandibular molars from two different suborders of primates.
- bioRxivNeuroSCAN: Exploring Neurodevelopment via Spatiotemporal Collation of Anatomical NetworksNoelle L. Koonce, Sarah E. Emerson, Dhananjay Bhaskar, Manik Kuchroo, Mark W. Moyle, Pura Arroyo-Morales, Nabor Vázquez Martínez, Smita Krishnaswamy, William Mohler, and Daniel Colón-RamosAug 2024
Volume electron microscopy (vEM) datasets such as those generated for connectome studies allow nanoscale quantifications and comparisons of the cell biological features underpinning circuit architectures. Quantifications of cell biological relationships in the connectome result in rich multidimensional datasets that benefit from data science approaches, including dimensionality reduction and integrated graphical representations of neuronal relationships. We developed NeuroSCAN, an online open- source platform that bridges sophisticated graph analytics from data science approaches with the underlying cell biological features in the connectome. We apply NeuroSCAN to a complete published record of C. elegans brain neuropils and demonstrate how these integrated representations of neuronal relationships facilitate comparisons across connectomes, catalyzing new insights on the structure-function of the circuits and their changes during development. NeuroSCAN is designed for intuitive examination and comparisons across connectomes, enabling synthesis of knowledge from high-level abstractions of neuronal relationships derived from data science techniques to the detailed identification of the cell biological features underpinning these abstractions.
- arXivLatent Representation Learning for Multimodal Brain Activity TranslationArman Afrasiyabi, Dhananjay Bhaskar, Erica L. Busch, Laurent Caplette, Rahul Singh, Guillaume Lajoie, Nicholas B. Turk-Browne, and Smita KrishnaswamySep 2024arXiv:2409.18462 [cs, q-bio]
Neuroscience employs diverse neuroimaging techniques, each offering distinct insights into brain activity, from electrophysiological recordings such as EEG, which have high temporal resolution, to hemodynamic modalities such as fMRI, which have increased spatial precision. However, integrating these heterogeneous data sources remains a challenge, which limits a comprehensive understanding of brain function. We present the Spatiotemporal Alignment of Multimodal Brain Activity (SAMBA) framework, which bridges the spatial and temporal resolution gaps across modalities by learning a unified latent space free of modality-specific biases. SAMBA introduces a novel attention-based wavelet decomposition for spectral filtering of electrophysiological recordings, graph attention networks to model functional connectivity between functional brain units, and recurrent layers to capture temporal autocorrelations in brain signal. We show that the training of SAMBA, aside from achieving translation, also learns a rich representation of brain information processing. We showcase this classify external stimuli driving brain activity from the representation learned in hidden layers of SAMBA, paving the way for broad downstream applications in neuroscience research and clinical contexts.
- arXivLooking through the mind’s eye via multimodal encoder-decoder networksArman Afrasiyabi, Erica Busch, Rahul Singh, Dhananjay Bhaskar, Laurent Caplette, Nicholas Turk-Browne, and Smita KrishnaswamySep 2024arXiv:2410.00047 [cs, eess, q-bio]
In this work, we explore the decoding of mental imagery from subjects using their fMRI measurements. In order to achieve this decoding, we first created a mapping between a subject’s fMRI signals elicited by the videos the subjects watched. This mapping associates the high dimensional fMRI activation states with visual imagery. Next, we prompted the subjects textually, primarily with emotion labels which had no direct reference to visual objects. Then to decode visual imagery that may have been in a person’s mind’s eye, we align a latent representation of these fMRI measurements with a corresponding video-fMRI based on textual labels given to the videos themselves. This alignment has the effect of overlapping the video fMRI embedding with the text-prompted fMRI embedding, thus allowing us to use our fMRI-to-video mapping to decode. Additionally, we enhance an existing fMRI dataset, initially consisting of data from five subjects, by including recordings from three more subjects gathered by our team. We demonstrate the efficacy of our model on this augmented dataset both in accurately creating a mapping, as well as in plausibly decoding mental imagery.
2023
- arXivGraph topological property recovery with heat and wave dynamics-based features on graphsDhananjay Bhaskar, Yanlei Zhang, Charles Xu, Xingzhi Sun, Oluwadamilola Fasina, Guy Wolf, Maximilian Nickel, Michael Perlmutter, and Smita KrishnaswamySep 2023arXiv:2309.09924 [cs, eess, stat]
In this paper, we propose Graph Differential Equation Network (GDeNet), an approach that harnesses the expressive power of solutions to PDEs on a graph to obtain continuous node- and graph-level representations for various downstream tasks. We derive theoretical results connecting the dynamics of heat and wave equations to the spectral properties of the graph and to the behavior of continuous-time random walks on graphs. We demonstrate experimentally that these dynamics are able to capture salient aspects of graph geometry and topology by recovering generating parameters of random graphs, Ricci curvature, and persistent homology. Furthermore, we demonstrate the superior performance of GDeNet on real-world datasets including citation graphs, drug-like molecules, and proteins.