文獻分享:Transformer Without Normalization
此影片分享了 Meta 團隊在 CVPR 2025 發表的文獻 Transformer Without Normalization。 該團隊打破固有對 Normalization Layer 必要性的認知,透過所設計的 Dynamic Tanh 取代 Normalization Layer 並取得了更好或者接近的性能。 值得學習的部分是本篇對自實驗進行上的嚴謹性,提供了從觀察到特性、驗證假設、理解算法侷限性很好的示範。

▶︎
What rebuilding AlphaGo teaches us about self-play, RL, and future of LLMs - Eric Jang

▶︎
But what is quantum computing? (Grover's Algorithm)

▶︎
邊坡崩塌危險度檢測-Serial(Single Core)・Pthread・OpenMP・CUDA 的平行加速比較

▶︎
The most beautiful formula not enough people understand

▶︎
SpaceX Was Just the First!

▶︎
Jinghao Lyu: Optimal Computation from Fluctuation Response

▶︎
Storchennest Live Webcam in Bad Salzungen, Thüringen

▶︎
Deep Dive into LLMs like ChatGPT

▶︎
Der Todesstern ist richtig dumm

▶︎
The Truth Behind My Appearance + A Very Honest Life Update

▶︎
都說年紀大學不會語言,這個80歲老人偏不信|Steve Kaufmann:80歲會20種語言,5個方法和你想的完全相反

▶︎
6. Monte Carlo Simulation

▶︎
How (and why) to take a logarithm of an image

▶︎
I turned an old van into a 2-STORY tiny house

▶︎
Train Your Own LLM – Tutorial
![Bukłaki [#21] Czy św. Faustynie naprawdę objawił się Jezus? || siostra Gaudia Skass](https://i.ytimg.com/vi/2l9eQV4hPGc/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLADJsmyMp-Wc5cF6xpo5LTN8eOXpA)
▶︎
Bukłaki [#21] Czy św. Faustynie naprawdę objawił się Jezus? || siostra Gaudia Skass

▶︎
从 LLM 到 Agent Skill,一期视频带你打通底层逻辑!
![Data Analysis with Python: Part 6 of 6 - Exploratory Data Analysis - A Case Study [Live Course]](https://i.ytimg.com/vi/XRKIa4k0h2E/hqdefault.jpg?sqp=-oaymwEjCNACELwBSFryq4qpAxUIARUAAAAAGAElAADIQj0AgKJDeAE=&rs=AOn4CLDT1AJIVi-WmmDlA_wHmGTK_JslsQ)
▶︎
Data Analysis with Python: Part 6 of 6 - Exploratory Data Analysis - A Case Study [Live Course]

▶︎
Jensen Huang on Vision, Risk, and the GPU | Only In America

▶︎
