Deep Learning: Transformers

En esta clase vemos la arquitectura "Transformer" para procesar secuencias y que está casi completamente basada en el concepto de Atención Neuronal. Transformer es la base de arquitecturas modernas de NLP, en particular, GPT y BERT.

CC6204 Deep Learning, Clase 09-2020: Red FF a mano en pytorch (y la versión estilo pytorch)

CC6204 Deep Learning, Clase 09-2020: Red FF a mano en pytorch (y la versión estilo pytorch)

CC6204 Deep Learning, Clase 01-2020: Introducción, IA vs ML vs DL, ¿Por qué DL ahora?

CC6204 Deep Learning, Clase 01-2020: Introducción, IA vs ML vs DL, ¿Por qué DL ahora?

🤖💡 ¡Descifrando los Secretos de los Transformers! 🚀📚 Explicación Completa: Attention is All You Need

🤖💡 ¡Descifrando los Secretos de los Transformers! 🚀📚 Explicación Completa: Attention is All You Need

Attention in transformers, step-by-step | Deep Learning Chapter 6

Attention in transformers, step-by-step | Deep Learning Chapter 6

Computación Neuromórfica - Qué es, Historia y FUTURO de la IA

Computación Neuromórfica - Qué es, Historia y FUTURO de la IA

C6204 Deep Learning, Clase Adicional-2020: Más acerca de Transformers (BERT, RoBerta, XLNet, GPT)

C6204 Deep Learning, Clase Adicional-2020: Más acerca de Transformers (BERT, RoBerta, XLNet, GPT)

Scott Aaronson - The TRUTH About Quantum Computing

Scott Aaronson - The TRUTH About Quantum Computing

CC6204 Deep Learning, Clase 08-2020: Entropía Cruzada y Backpropagation a mano con Tensores

CC6204 Deep Learning, Clase 08-2020: Entropía Cruzada y Backpropagation a mano con Tensores

Training Sand to Think: Artificial General Intelligence & Future of Physics

Training Sand to Think: Artificial General Intelligence & Future of Physics

CC6204 Deep Learning, Clase 12-2020: Inicialización, Normalización y Batch Normalization

CC6204 Deep Learning, Clase 12-2020: Inicialización, Normalización y Batch Normalization

Funciones de activación a detalle (Redes neuronales)

Funciones de activación a detalle (Redes neuronales)

Attention is all you need (Transformer) - Model explanation (including math), Inference and Training

Attention is all you need (Transformer) - Model explanation (including math), Inference and Training

La magia del Machine Learning: Embeddings

La magia del Machine Learning: Embeddings

¿Qué es una Red Neuronal? | Aprendizaje Profundo. Capítulo 1

¿Qué es una Red Neuronal? | Aprendizaje Profundo. Capítulo 1

CC6204 Deep Learning, Class 13-2020: Optimization Algorithms, SGD with Momentum, RMSProp, Adam

CC6204 Deep Learning, Class 13-2020: Optimization Algorithms, SGD with Momentum, RMSProp, Adam

CC6204 Deep Learning, Clase 10-2020: Generalización, Test-Dev-Train set, e Intro. a Regularización

CC6204 Deep Learning, Clase 10-2020: Generalización, Test-Dev-Train set, e Intro. a Regularización

CC6204 Deep Learning, Clase 15-2020: Pooling, AlexNet, VGG

CC6204 Deep Learning, Clase 15-2020: Pooling, AlexNet, VGG

Transformers, the tech behind LLMs | Deep Learning Chapter 5

Transformers, the tech behind LLMs | Deep Learning Chapter 5

Las REDES TRANSFORMER ¡EXPLICADAS!

Las REDES TRANSFORMER ¡EXPLICADAS!

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24