Lecture 9 - Speech Recognition (ASR) [Andrew Senior]
Automatic Speech Recognition (ASR) is the task of transducing raw audio signals of spoken language into text transcriptions. This talk covers the history of ASR models, from Gaussian Mixtures to attention augmented RNNs, the basic linguistics of speech, and the various input and output representations frequently employed.
![Lecture 10 - Text to Speech (TTS) [Andrew Senior]](https://i.ytimg.com/vi/1Mb-KHQNdcM/hqdefault.jpg?sqp=-oaymwE9CNACELwBSFryq4qpAy8IARUAAAAAGAElAADIQj0AgKJDeAHwAQH4Af4JgALQBYoCDAgAEAEYZSBlKGUwDw==&rs=AOn4CLByHRMrv605AW7zhxrYPtu-Kz6q7w)
▶︎
Lecture 10 - Text to Speech (TTS) [Andrew Senior]

▶︎
Automatic Speech Recognition - An Overview
![Lecture 8 - Generating Language with Attention [Chris Dyer]](https://i.ytimg.com/vi/ah7_mfl7LD0/hqdefault.jpg?sqp=-oaymwE9CNACELwBSFryq4qpAy8IARUAAAAAGAElAADIQj0AgKJDeAHwAQH4AboHgALQBYoCDAgAEAEYfyBCKDgwDw==&rs=AOn4CLBCK5TkTIkoAqM1zNNkaPpMgUvDMA)
▶︎
Lecture 8 - Generating Language with Attention [Chris Dyer]

▶︎
Mel-Frequency Cepstral Coefficients Explained Easily

▶︎
Lecture 12: End-to-End Models for Speech Processing

▶︎
ML for Audio Study Group - Intro to Audio and ASR Deep Dive

▶︎
Visualizing transformers and attention | Talk for TNG Big Tech Day '24

▶︎
State-of-the-Art in Speech Technologies

▶︎
Stanford Seminar - Deep Speech: Scaling up end-to-end speech recognition

▶︎
A Basic Introduction to Speech Recognition (Hidden Markov Model & Neural Networks)

▶︎
CS480/680 Lecture 17: Hidden Markov Models
![Lecture 3 - Language Modelling and RNNs Part 1 [Phil Blunsom]](https://i.ytimg.com/vi/nfyE8oF23yQ/hqdefault.jpg?sqp=-oaymwE9CNACELwBSFryq4qpAy8IARUAAAAAGAElAADIQj0AgKJDeAHwAQH4AboHgALQBYoCDAgAEAEYZiBmKGYwDw==&rs=AOn4CLBXmQkVbEqLcpGMoAfxrlWxU2QkCg)
▶︎
Lecture 3 - Language Modelling and RNNs Part 1 [Phil Blunsom]

▶︎
Inside Anthropic, the $965 Billion AI Juggernaut | The Circuit

▶︎
Automatic Speech Recognition, a lecture by Kai-Fu Lee

▶︎
Training Sand to Think: Artificial General Intelligence & Future of Physics

▶︎
Lecture 1 | Natural Language Processing with Deep Learning

▶︎
CS480/680 Lecture 19: Attention and Transformer Networks

▶︎
Stanford CME295 Transformers & LLMs | Autumn 2025 | Lecture 1 - Transformer

▶︎
Stanford Seminar - Deep Learning in Speech Recognition

▶︎
