AI는 무엇을 숨기고 있는가, '전두엽 절제술 AI 버전' | 희소 오토인코더

인간의 뇌를 모방해 만든 대형 언어모델의 뉴런들은 실제로 어떠한 개념을 담고 있을까요? 이번 영상에선, 블랙박스처럼만 여겨졌던 AI 모델들을 이해하려는 시도 중 각광받고 있는 스파스 오토인코더 (Sparse Autoencoder) 에 대해 살펴보겠습니다. Gemma walkthrough notebook: https://colab.research.google.com/dri... Most animations made with Manim: https://github.com/3b1b/manim References and Further Reading Chris Olah’s original “Dark Matter of Neural Networks” post: https://transformer-circuits.pub/2024... Great recent interview with Chris Olah: • Dario Amodei: Anthropic CEO on Claude, AGI... Gemma Scope: https://arxiv.org/pdf/2408.05147 Experiment with SAEs yourself here! https://www.neuronpedia.org/ Relevant work from the Anthropic team: https://transformer-circuits.pub/2022... https://transformer-circuits.pub/2023... https://transformer-circuits.pub/2024... Excellent intro Mechanistic Interpretability: https://arena3-chapter1-transformer-i... Neel Nanda’s Mechanistic Interpretability Explainer: https://dynalist.io/d/n2ZWtnoYHrU1s4v... Transformer Lens: https://github.com/TransformerLensOrg... SAE Lens: https://jbloomaus.github.io/SAELens/l... Technical Notes 1. There are more advanced and more meaningful ways to map mid layer vectors to outputs, see: https://arxiv.org/pdf/2303.08112, https://neuralblog.github.io/logit-pr..., https://www.lesswrong.com/posts/AcKRB... 2. The 6x2304 matrix is actually 7x2304, we’re ignoring the /bos token. 3. Gemma also includes positional embeddings and lots and lots of normalization layers, which we didn’t really cover 4. I’m conflating tokens and words sometimes, in this example each word is a token, so we don’t have to worry about it too much 5. The “_” characters represent spaces in the token strings Translated & Dubbed by Jaeseok Jeong, CineLingo

인류가 이해한 AI는 여기까지 입니다 | 알렉스넷

인류가 이해한 AI는 여기까지 입니다 | 알렉스넷

Why Is AI Still "Downhill"? | Gradient Descent and Local Minima

Why Is AI Still "Downhill"? | Gradient Descent and Local Minima

AI시대, 인간의 상호 작용이 중요한 이유는?

AI시대, 인간의 상호 작용이 중요한 이유는?

Don't make chat-g-p-t-i at home; buy it | Scaling Law

Don't make chat-g-p-t-i at home; buy it | Scaling Law

The Power of Quantum Computers: Solving a Problem That Would Take 10 Billion Years in Just 5 Minu...

The Power of Quantum Computers: Solving a Problem That Would Take 10 Billion Years in Just 5 Minu...

챗GPT가 당신을 은근히 무시하는 이유

챗GPT가 당신을 은근히 무시하는 이유

Hoagy Cunningham — Finding distributed features in LLMs with sparse autoencoders [TAIS 2024]

Hoagy Cunningham — Finding distributed features in LLMs with sparse autoencoders [TAIS 2024]

Deepseek's +99 Enhanced Transformer Club | KV Cache & Multihead Potential Attention

Deepseek's +99 Enhanced Transformer Club | KV Cache & Multihead Potential Attention

트랜스포머, ChatGPT가 트랜스포머로 만들어졌죠. - DL5

트랜스포머, ChatGPT가 트랜스포머로 만들어졌죠. - DL5

AI가 인류에게 미친 최고의 순기능 (대 AI 시대 + 과학 이건 못참지)

AI가 인류에게 미친 최고의 순기능 (대 AI 시대 + 과학 이건 못참지)

꼭 알아야할 안드레 카파시 30분 인터뷰 완전정리 - AI시대의 필수 인사이트!

꼭 알아야할 안드레 카파시 30분 인터뷰 완전정리 - AI시대의 필수 인사이트!

기업이 꼭 알아야 할 '온톨로지'의 모든 것 (김학래 중앙대 교수)

기업이 꼭 알아야 할 '온톨로지'의 모든 것 (김학래 중앙대 교수)

자연어 처리 트랜스포머 1강(Embedding, Positional Encoding)

자연어 처리 트랜스포머 1강(Embedding, Positional Encoding)

그 이름도 유명한 어텐션, 이 영상만 보면 이해 완료! - DL6

그 이름도 유명한 어텐션, 이 영상만 보면 이해 완료! - DL6

수많은 정보는 LLM 모델 속 어디에 저장되어있는걸까? | DL 7

수많은 정보는 LLM 모델 속 어디에 저장되어있는걸까? | DL 7

뉴럴네트워크라는걸 들어 보셨다면 보셔야 할 영상. - DL1

뉴럴네트워크라는걸 들어 보셨다면 보셔야 할 영상. - DL1

Without this, there would be no AI today | Backpropagation

Without this, there would be no AI today | Backpropagation

When these 10 billion dials come together, you get ChatGPT | Perceptron

When these 10 billion dials come together, you get ChatGPT | Perceptron

LLM 설명 (요약버전)

LLM 설명 (요약버전)

AI 헛소리는 버그 아니다... 할루시네이션 구조적 이유, 최신 기술, 꿀팁 | GPT, Gemini, Grok, Claude 비교

AI 헛소리는 버그 아니다... 할루시네이션 구조적 이유, 최신 기술, 꿀팁 | GPT, Gemini, Grok, Claude 비교