Lec 08. Architectures: Transformers

MIT 6.7960 Deep Learning, Fall 2024 Instructor: Phillip Isola View the complete course: https://ocw.mit.edu/courses/6-7960-de... YouTube Playlist:    • MIT 6.7960 Deep Learning, Fall 2024   This video introduces transformers, focusing on three key ideas: tokens, attention, and positional codes. It also explores how transformers relate to MLPs, GNNs, and CNNs as variations on common principles. License: Creative Commons BY-NC-SA More information at https://ocw.mit.edu/terms More courses at https://ocw.mit.edu Support OCW at http://ow.ly/a1If50zVRlQ We encourage constructive comments and discussion on OCW’s YouTube and other social media channels. Personal attacks, hate speech, trolling, and inappropriate comments are not allowed and may be removed. More details at https://ocw.mit.edu/comments.