World Modeling: Evaluation and State Computation

Yoav Artzi (Cornell University) https://simons.berkeley.edu/talks/yoa... Topics in Intelligence: World Models and Social Reasoning This talk briefly describes three projects. The first two focus on evaluating world modeling capabilities in frontier models through the evaluation of dual perspective reasoning and knot manipulation. The third part describes the state separation hypothesis, which posits that mechanically separating prediction from state computation in LLMs benefits performance.