MIA: Shreya Johri, Evaluating AI agents in biological discovery; primer by Maha Shady

Models, Inference, and Algorithms | April 15, 2026 Broad Institute of MIT and Harvard Seminar: Evaluating the autonomous and copilot limitations of AI agents for biological discovery Shreya Johri Graduate Student, Harvard University Abstract: Recent advances in large language models (LLMs) have improved their ability to execute structured analytical workflows, including standard bioinformatic pipelines. However, computational biology rarely consists of deterministic pipeline execution alone. Biological datasets are heterogeneous and noisy, and meaningful discovery often requires open-ended hypothesis generation and iterative reasoning over multimodal evidence. The extent to which emerging agentic AI systems can support this mode of scientific discovery remains poorly characterized. Here, we systematically evaluate the capabilities and limitations of agentic AI for biological discovery using multimodal oncology datasets spanning 15 cancer types. We benchmark 10 analysis tasks designed to vary in biological reasoning complexity, including replication of canonical workflows, tumor-program characterization, tumor-microenvironment analysis, and immune-cell discovery tasks. We also benchmark autonomous and human-copilot agent configurations. Our results delineate the current boundaries of agentic AI in computational biology and provide a framework for evaluating AI systems designed to support scientific discovery. Primer: AI agents in biomedical research Maha Shady Graduate Student, Harvard University Abstract: LLM-based agents are increasingly used in biomedical research pipelines, for literature synthesis, data analysis, hypothesis generation, and clinical decision support. This talk provides an overview of how these systems work and where they break. We cover the core architectural components of single and multi-agent systems, as well as current evaluation benchmarks and failure mechanisms specific to biomedical applications. The talk may serve as a practical guide for developing and evaluating these systems, with consideration of failure modes most consequential in biomedical settings. About MIA: The Models, Inference & Algorithms (MIA) Initiative at the Broad Institute supports learning and collaboration across the interface of biology and medicine with mathematics, statistics, machine learning, and computer science. Our weekly meetings are open and pedagogical, emphasizing lucid exposition of computational ideas over rapid-fire communication of results. MIA is hosted by the Eric and Wendy Schmidt Center at the Broad Institute. Relevant Links: MIA Website: https://www.broadinstitute.org/mia MIA YouTube Playlist: https://broad.io/MIAPlaylist Copyright Broad Institute, 2026. All rights reserved.

CLEAR 2026: Keynote, Proxy Variables for Causal Effect Estimation with Hidden Confounding
▶︎

CLEAR 2026: Keynote, Proxy Variables for Causal Effect Estimation with Hidden Confounding

Training Sand to Think: Artificial General Intelligence & Future of Physics
▶︎

Training Sand to Think: Artificial General Intelligence & Future of Physics

RL for Agents Workshop - Deep Dive on Training Agents with RL and Open Source
▶︎

RL for Agents Workshop - Deep Dive on Training Agents with RL and Open Source

Yann LeCun: World Models: Enabling the next AI revolution
▶︎

Yann LeCun: World Models: Enabling the next AI revolution

Python Variables | Python Operators | Python Tutorial For Beginners | Intellipaat
▶︎

Python Variables | Python Operators | Python Tutorial For Beginners | Intellipaat

Zig 2026: No-AI Policy, $670K Foundation, Left GitHub & Why Zig Isn’t 1.0 - Andrew Kelley Explains
▶︎

Zig 2026: No-AI Policy, $670K Foundation, Left GitHub & Why Zig Isn’t 1.0 - Andrew Kelley Explains

Trump Gets Booed and Falls Asleep at NBA Finals, Spreads Deranged CA Election Lies: A Closer Look
▶︎

Trump Gets Booed and Falls Asleep at NBA Finals, Spreads Deranged CA Election Lies: A Closer Look

Something is jamming GPS over Europe. Here's what we found
▶︎

Something is jamming GPS over Europe. Here's what we found

LIVE: Conan O’Brien speaks at Harvard graduation ceremony (full)
▶︎

LIVE: Conan O’Brien speaks at Harvard graduation ceremony (full)

Putin ready for anything. Follow the live broadcast with Alessandro Orsini
▶︎

Putin ready for anything. Follow the live broadcast with Alessandro Orsini

Microsoft Fabric and Power BI - Developer of the Future⚡ [Full Course]
▶︎

Microsoft Fabric and Power BI - Developer of the Future⚡ [Full Course]

AI Pioneer Geoffrey Hinton: AI Is Conscious, Superintelligence is Coming, And We Should Be Worried
▶︎

AI Pioneer Geoffrey Hinton: AI Is Conscious, Superintelligence is Coming, And We Should Be Worried

The French Do Not Care About Work
▶︎

The French Do Not Care About Work

Politics Chat, June 9, 2026
▶︎

Politics Chat, June 9, 2026

What is SonarQube | Introduction SonarQube | SonarQube Tutorial | SonarQube Basics | Intellipaat
▶︎

What is SonarQube | Introduction SonarQube | SonarQube Tutorial | SonarQube Basics | Intellipaat

Sarah Paine - Why Putin and Xi can't escape geography
▶︎

Sarah Paine - Why Putin and Xi can't escape geography

Exposing The Solid State Donut Battery. It's Over.
▶︎

Exposing The Solid State Donut Battery. It's Over.

ACLS Drugs Review with Nurse Eunice  📚💉
▶︎

ACLS Drugs Review with Nurse Eunice 📚💉

"A.I. and Our Economic Future," Professor Chad Jones
▶︎

"A.I. and Our Economic Future," Professor Chad Jones

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker
▶︎

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker