Large Models for Single-Cell Omics and Drug Discovery: Data, Pretraining and Closed-Loop Environment
This seminar was delivered on 8 September 2025. For more information: https://spds.sydney.edu.au/ Speaker: Dr Haotian Cui, University of Toronto Abstract: This presentation will discuss recent advances in foundation models for single-cell omics and therapeutic discovery, focusing on the development of scGPT, a generative transformer model trained on over 33 million single-cell profiles. It will cover key design principles—such as generative pretraining and multi-task alignment—that enable broad applications including cell type annotation, perturbation response prediction, and reference mapping. The talk will also highlight emerging multimodal extensions such as scGPT-spatial and MethylGPT, and emphasize the importance of perturbational training for modeling cellular dynamics. Finally, it will introduce LUMI-lab, a closed-loop self-driving platform developed by Dr. Cui and collaborators, which integrates model-guided design, synthesis, and validation to accelerate mRNA delivery—demonstrating a scalable framework for virtual cell modeling and active learning in biology. About the speaker: Dr Haotian Cui is a researcher specializing in machine learning, genomics, and drug discovery. He recently completed his Ph.D. in Computer Science at the University of Toronto, advised by Prof. Bo Wang. His research focuses on developing large-scale self-supervised and generative foundation models for single-cell omics and molecular biology, integrating AI with experimental pipelines to accelerate therapeutic discovery. He is the lead author of scGPT—one of the first generative foundation models for single-cell multi-omics. He also led the development of LUMI-lab, an autonomous AI-driven platform for mRNA therapeutics. He has published numerous papers in leading journals and conferences, including Nature, Nature Methods, Nature Communications, ACL, EMNLP, ICML.

Next generation CRISPR screens based on single-cell transcriptomics | Tilmann Buerckstuemmer

Quantum Consciousness and the Origin of Life

AlphaFold - The Most Useful Thing AI Has Ever Done

Characterizing cell-type spatial relationships across length scales in spatially resolved omics data

Towards multimodal foundation models in molecular cell biology | Haotian Cui
![Yann LeCun's $1B Bet Against LLMs [Part 1]](https://i.ytimg.com/vi/kYkIdXwW2AE/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLDbV4izF3i-wxevCVIn7FJjoy1vlA)
Yann LeCun's $1B Bet Against LLMs [Part 1]

How AI Cracked the Protein Folding Code and Won a Nobel Prize

How Are We So Good at Folding Proteins? - With David Balchin

How ASML Makes Chips Faster With Its New $400 Million High NA Machine

How Proctor’s texts in Karen Read lawsuit could free dangerous criminals

What do tech pioneers think about the AI revolution? - The Engineers, BBC World Service

This is not the AI we were promised | The Royal Society

The Successor to CRISPR May Be Even More World Changing

The World's Most Important Machine

Yann LeCun: World Models: Enabling the next AI revolution

A Systematic Comparison of Single-Cell Perturbation Response Prediction Models

“Curing All Disease by next century is too conservative" - Mark Zuckerberg

Andrew Ng: Building Faster with AI

Nobel Prize lecture: Demis Hassabis, Nobel Prize in Chemistry 2024

