Computational Molecular Medicine

Single-cell multi-omics data present unique challenges: extreme sparsity and technical noise from dropouts, high dimensionality with thousands of features per cell, and pervasive batch and donor effects that can obscure true biological variation. Robust normalization, dimensionality reduction, and noise-modeling approaches are therefore essential to recover reliable signals and ensure that downstream analyses reflect genuine cellular heterogeneity rather than artifacts of the technology. Our lab combines advanced computational methods with high-dimensional biological data to uncover mechanisms of disease, with a particular emphasis on cancer and metabolic disorders. By integrating single-cell multi-omics, and in vivo model data, we aim to build a mechanistic and spatially resolved understanding of disease progression and therapeutic response.

Single-Cell Multi-Omics in Clinical Cohorts

We specialize in the analysis of clinical single-cell multi-omics datasets, particularly from cancer and metabolic disease cohorts. Using state-of-the-art machine learning techniques, we analyze single-cell RNA sequencing and chromatin accessibility data to characterize cellular heterogeneity and regulatory dynamics. Our pipeline includes robust dimensionality reduction, clustering, and batch correction methods, allowing us to identify distinct cell populations and states across individuals. Through probabilistic graphical modeling and motif enrichment analysis, we reconstruct gene regulatory networks that govern disease-specific transcriptional programs. These approaches allow us to overcome the sparsity and noise inherent to single-cell data and extract biologically meaningful patterns that inform prognosis and therapeutic strategy.

Multi-Omics in Mouse Models of Cancer

To explore tumor development and treatment effects in vivo, we analyze multi-omics data generated from mouse models, including xenografts and genetically induced cancers. These datasets are complex, encompassing multiple axes of variation such as treatment regimens, time points, and tumor subtypes. To disentangle these factors, we develop tailored probabilistic latent variable models (LVMs) that reveal how sources of variability interact and which molecular features are relevant to human disease. This framework enables us to translate findings from experimental models into clinically actionable insights.

Adrien Jolly
Adrien Jolly
Postdoctoral Researcher
Ali Yavuz Çakır
Ali Yavuz Çakır
Research Scientist
Sarmad Ahmad Khan
Sarmad Ahmad Khan
Ph.D. Candidate

My research interest is in bioinformatics specifically focusing on the multi-omics integration and predictive models based on the omics data helping in oncological studies.

Yihao Liu
Yihao Liu
Ph.D. Candidate