Auto-Formalization for Trustworthy Planning

Despite the rapid advancement of AI, most systems in high-stakes applications remain primarily limited to rule-based interactions and cannot reliably plan or execute complex user tasks. Despite recent efforts in using large language models (LLMs) to plan as agents, their hallucinations and lack of verifiability undermine executability and trust, preventing real-world deployment. This proposal advances an alternative paradigm: LLM-as-formalizer. Instead of relying on LLMs to generate plans directly, we use them as a code generator to translate a user’s environment and goal into formal languages (such as PDDL) that can be deterministically solved by off-the-shelf solvers. This neurosymbolic approach combines the flexibility of LLMs with the reliability of symbolic systems, offering a pathway toward trustworthy, generalizable planning. In this talk, I will discuss a few advances in 2025 including a comprehensive evaluation of LLM's auto-formalization ability under a unified methodological framework, and also ongoing work on iterative and multi-agent planning in partially observable environments. Li "Harry" Zhang is an assistant professor at Drexel University, focusing on Natural Language Processing (NLP) and artificial intelligence (AI). He obtained his PhD degree from the University of Pennsylvania in 2024, advised by Prof. Chris Callison-Burch and chaired by Prof. Dan Roth. He was a year-long intern in 2023 at the Allen Institute for Artificial Intelligence. He obtained his Bachelor's degree from the University of Michigan in 2018, mentored by Prof. Rada Mihalcea and Prof. Dragomir Radev. His research agenda use large language models (LLMs) as auto-formalizers for trustworthy problem-solving, accepted to the AAAI 2026 New Faculty Highlights program. He has published more than 30 peer-reviewed papers in NLP and AI conferences, such as ACL, EMNLP, and NAACL, that have been cited more than 3,000 times. He also consistently serves as Area Chair, Session Chair, and reviewer in those venues. Outside academia, he is a sponsored musician, producer, and content creator having over 60,000 subscribers across streaming platforms.

Don't learn AI Agents without Learning these Fundamentals

Don't learn AI Agents without Learning these Fundamentals

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

Mapping the Data Center Industry: Who Benefits, Who Calls the Shots, and What to Do About It

Mapping the Data Center Industry: Who Benefits, Who Calls the Shots, and What to Do About It

Software engineering at the tipping point

Software engineering at the tipping point

How To Think SO CLEARLY People Assume You're A Genius

How To Think SO CLEARLY People Assume You're A Genius

Webinar | Introduction to parallel performance engineering

Webinar | Introduction to parallel performance engineering

How AI agents & Claude skills work (Clearly Explained)

How AI agents & Claude skills work (Clearly Explained)

The Strange Math That Predicts (Almost) Anything

The Strange Math That Predicts (Almost) Anything

Systems Thinking for Leaders: Designing Solutions That Work

Systems Thinking for Leaders: Designing Solutions That Work

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

Understanding Equilibria in Multi-Agent Systems - Michael Wooldridge, University of Oxford

Understanding Equilibria in Multi-Agent Systems - Michael Wooldridge, University of Oxford

Scott and Mark learn...how agents reshape software engineering | BRK247

Scott and Mark learn...how agents reshape software engineering | BRK247

Why birth rates are falling everywhere all at once | FT

Why birth rates are falling everywhere all at once | FT

Demis Hassabis: Agents, AGI & The Next Big Scientific Breakthrough

Demis Hassabis: Agents, AGI & The Next Big Scientific Breakthrough

Yann LeCun: World Models: Enabling the next AI revolution

Yann LeCun: World Models: Enabling the next AI revolution

From LLMs to Agents: Generalizability from the Inside Out

From LLMs to Agents: Generalizability from the Inside Out

Something is jamming GPS over Europe. Here's what we found

Something is jamming GPS over Europe. Here's what we found

FULL DISCUSSION: Google's Demis Hassabis, Anthropic's Dario Amodei Debate the World After AGI | AI1G

FULL DISCUSSION: Google's Demis Hassabis, Anthropic's Dario Amodei Debate the World After AGI | AI1G

Movement Primitives as Action Sequence Models for Efficient Robot Learning

Movement Primitives as Action Sequence Models for Efficient Robot Learning

From Identification to Accountability: the Evolving Practice of Attribution

From Identification to Accountability: the Evolving Practice of Attribution