Transformer誕生物語｜Attention is All You Need

こちらの論文を解説します： Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems 30 (2017). https://proceedings.neurips.cc/paper/... 📄 スライド：https://speakerdeck.com/mathbullet/do... 🐧 Twitter：https://x.com/_mathbullet 🎥 エージェント入門： • 【入門】AIエージェントの深淵【図解】 👨🏻‍🎓 アジェンダ 00:00:25 今回の内容 00:01:48 なぜこの論文が重要なのか 00:03:45 人工知能の論文を読む切り口 00:06:41 系列モデリング 00:11:45 対抗馬は何か 00:19:57 主提案は何か 00:31:20 位置符号化 00:37:20 マルチヘッド自己注意 00:56:12 Point-wise Feed Forward Network 00:59:57 最終出力 01:06:23 評価のWhat/Result 01:13:32 その後の展開 --- 「数理の弾丸」は、人工知能や言語にまつわる専門知をわかりやすく、誤魔化さずに伝えることを目指すチャンネルです。 ■スピーカー：吉田、スミス #transformer #llm #attention #chatgpt

Llama 3.1: The technology behind the latest and greatest language models

Llama 3.1: The technology behind the latest and greatest language models

甘利俊一「人工知能と数理脳科学」－2024年ノーベル物理学賞に関する特別講演

甘利俊一「人工知能と数理脳科学」－2024年ノーベル物理学賞に関する特別講演

AI Is Creating A Rare Opportunity For Investors. How Jim Roppel Is Playing It. | Investing With IBD

AI Is Creating A Rare Opportunity For Investors. How Jim Roppel Is Playing It. | Investing With IBD

Yann LeCun: World Models: Enabling the next AI revolution

Yann LeCun: World Models: Enabling the next AI revolution

We Might Be Wrong About Black Holes

We Might Be Wrong About Black Holes

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

RL for Agents Workshop - Deep Dive on Training Agents with RL and Open Source

RL for Agents Workshop - Deep Dive on Training Agents with RL and Open Source

Pensions: What’s being snuck past us (yet again) during the World Cup

Pensions: What’s being snuck past us (yet again) during the World Cup

AlphaFold - The Most Useful Thing AI Has Ever Done

AlphaFold - The Most Useful Thing AI Has Ever Done

【深層学習】Transformer - Multi-Head Attentionを理解してやろうじゃないの【ディープラーニングの世界vol.28】#106 #VRアカデミア #DeepLearning

【深層学習】Transformer - Multi-Head Attentionを理解してやろうじゃないの【ディープラーニングの世界vol.28】#106 #VRアカデミア #DeepLearning

Training Sand to Think: Artificial General Intelligence & Future of Physics

Training Sand to Think: Artificial General Intelligence & Future of Physics

What rebuilding AlphaGo teaches us about self-play, RL, and future of LLMs - Eric Jang

What rebuilding AlphaGo teaches us about self-play, RL, and future of LLMs - Eric Jang

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

Turing Award Winner: Disagreeing with Google, Postgres, Future Problems | Mike Stonebraker

"Emotionale KI: Was bedeutet sie für unser Menschsein?" Vortrag vom Philosophen Prof. Markus Gabriel

"Emotionale KI: Was bedeutet sie für unser Menschsein?" Vortrag vom Philosophen Prof. Markus Gabriel

「生成AI」(3) 松尾豊・東京大学大学院教授　2024.3.15

「生成AI」(3) 松尾豊・東京大学大学院教授　2024.3.15

【数えるだけ】AIが単語を理解するトリックが巧妙すぎる【大規模言語モデル2】#130

【数えるだけ】AIが単語を理解するトリックが巧妙すぎる【大規模言語モデル2】#130

Agent Skills の仕組みと実践・最前線【コンテキストエンジニアリング】

Agent Skills の仕組みと実践・最前線【コンテキストエンジニアリング】

【論文解読】ハーネスエンジニアリングの自動化【Meta-Harness】

【論文解読】ハーネスエンジニアリングの自動化【Meta-Harness】

But what is quantum computing? (Grover's Algorithm)

But what is quantum computing? (Grover's Algorithm)

Yann LeCun's $1B Bet Against LLMs [Part 1]

Yann LeCun's $1B Bet Against LLMs [Part 1]