Jimmy Ba | How to steer foundation models?
New Technologies in Mathematics Seminar 3/9/2023 Speaker: Jimmy Ba, University of Toronto Title: How to steer foundation models? Abstract: By conditioning on natural language instructions, foundation models and large language models (LLMs) have displayed impressive capabilities as general-purpose computers. However, task performance depends significantly on the quality of the prompt used to steer the model. Due to the lack of knowledge of how foundation models work, most effective prompts have been handcrafted by humans through a demanding trial-and-error process. To reduce the human effort in this alignment process, I will discuss a few approaches to steer these powerful models to excel in various downstream language and image tasks.

▶︎
Diffusion Language Models: The Next Big Shift in GenAI
![Yann LeCun's $1B Bet Against LLMs [Part 1]](https://i.ytimg.com/vi/kYkIdXwW2AE/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLDbV4izF3i-wxevCVIn7FJjoy1vlA)
▶︎
Yann LeCun's $1B Bet Against LLMs [Part 1]

▶︎
Training Sand to Think: Artificial General Intelligence & Future of Physics

▶︎
Yann LeCun: World Models: Enabling the next AI revolution

▶︎
Sergey Levine (UC Berkeley & PI): RL for Robot Foundation Models

▶︎
Visualizing transformers and attention | Talk for TNG Big Tech Day '24

▶︎
Why Aliens Would NEVER Invade Africa

▶︎
How To Think SO CLEARLY People Assume You're A Genius

▶︎
Terence Tao: Nobody Understands Why AI Actually Works

▶︎
Flow Matching for Generative Modeling (Paper Explained)

▶︎
Dan Freed | First Proof Introduction

▶︎
How AI Cracked the Protein Folding Code and Won a Nobel Prize

▶︎
MAMBA from Scratch: Neural Nets Better and Faster than Transformers

▶︎
Andrej Karpathy: From Vibe Coding to Agentic Engineering w/ Stephanie Zhan

▶︎
They're laughing at the SpaceX bubble

▶︎
Conan O’Brien Delivers the Commencement Address | Harvard Commencement 2026

▶︎
Conan O’Brien Mocks Trump At Harvard Commencement | Crowd Erupts During Viral Speech

▶︎
Pierfrancesco Urbani | Separation of timescales controls feature learning...

▶︎
xAI Founding Team Member Jimmy Ba on The Need for Humanity in AI

▶︎
