Leland McInnes, John Healy | Clustering: A Guide for the Perplexed
PyData DC 2016 Finding clusters is a powerful tool for understanding and exploring data. While the task sounds easy, it can be surprisingly difficult to do it well. Most standard clustering algorithms can, and do, provide very poor clustering results in many cases. We discuss how to do clustering correctly. Finding clusters is a powerful tool for understanding and exploring data. While the task sounds easy, it can be surprisingly difficult to it well. Most standard clustering algorithms can, and do, provide very poor clustering results in many cases. Our intuitions for what a cluster is are not as clear as we would like, and can easily be lead astray. We will attempt to find a definition of clustering that makes sense for most cases, and introduce an algorithm for finding such clusters, along with a high performance python implementation of the algorithm, building up more intuition for what clustering really means as we go. 00:00 Welcome! 00:10 Help us add time stamps or captions to this video! See the description for details. Want to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVi...

HDBSCAN, Fast Density Based Clustering, the How and the Why - John Healy

High Quality, High Performance Clustering with HDBSCAN | SciPy 2016 | Leland McInnes

12. Clustering

BioTuring Webinar: A Practical Guide to UMAP by its author John Healy

Why Aliens Would NEVER Invade Africa

Creator of C++: Bell Labs, Negative Overhead Abstraction, Mistakes | Bjarne Stroustrup

NestJS Full Course for Beginners in 2026 | Build a Production-Ready API

A Bluffer's Guide to Dimension Reduction - Leland McInnes

Data Analysis 7: Clustering - Computerphile

What do tech pioneers think about the AI revolution? - The Engineers, BBC World Service

6. Monte Carlo Simulation

MIT Just Revealed the AI Bubble's Fatal Flaw

AlphaFold - The Most Useful Thing AI Has Ever Done

UMAP Uniform Manifold Approximation and Projection for Dimension Reduction | SciPy 2018 |

Inside Anthropic, the $965 Billion AI Juggernaut | The Circuit

Christian Hennig - Assessing the quality of a clustering

8.6 David Thompson (Part 6): Nonlinear Dimensionality Reduction: KPCA

John Healy: MAP All the Things | PyData LA 2019

Creating correct and capable classifiers - Ian Ozsvald

