High Quality, High Performance Clustering with HDBSCAN | SciPy 2016 | Leland McInnes
Data clustering is a powerful tool for data analysis. It can be particularly useful in exploratory data analysis for helping to summarize and give intuition about a dataset. Despite it's power clustering is used for this task far less frequently than it could be. A plethora of options for clustering algorithms exist, and we will provide a survey of some of the more popular options, discussing their strengths and weaknesses, particularly with regard to exploratory data analysis. Our focus, however, is on a relatively new algorithm that appears to be the best equipped to meet the needs of exploratory data analysis: HDBSCAN* has the strengths of density based algorithms, has a small robust set of parameters, and with suitable implementation can be made highly scalable to large datasets. We will discuss how the algorithm works, taking a few different perspectives, and explain the techniques used for a high performance implementation. Finally we'll discuss ways to extend the algorithm, drawing on ideas from topological data analysis. More info on HDBSCAN here: https://github.com/lmcinnes/hdbscan. See the complete SciPy 2016 Conference talk & tutorial playlist here: • SciPy 2016: Scientific Computing with Pyth...

Simulating Robot, Vehicle, Spacecraft, and Animal Motion w/ Python Advanced | SciPy 2016 Tutorial

HDBSCAN, Fast Density Based Clustering, the How and the Why - John Healy

Leland McInnes, John Healy | Clustering: A Guide for the Perplexed

Datashader Revealing the Structure of Genuinely Big Data | SciPy 2016 | James A Bednar

Leland McInnes: UMAP, HDBSCAN & the Geometry of Data | Learning from Machine Learning #10

9: HBDscan

Christian Hennig - Assessing the quality of a clustering

Clustering with DBSCAN, Clearly Explained!!!

UMAP Uniform Manifold Approximation and Projection for Dimension Reduction | SciPy 2018 |

How To Use UMAP and HDBScan To Surface Insights and Discover Issues

TIME SERIES CLUSTERING | HDBSCAN for Clustering 811 Products Sales

Scikit TDA: Topological Tools for the Python Ecosystem | SciPy 2019 | Nathaniel Saul

Brian Kent: Density Based Clustering in Python

A Bluffer's Guide to Dimension Reduction - Leland McInnes

Judge Can’t Stop Laughing At Sovereign Citizen’s Courtroom Meltdown!!!

DBSCAN Clustering Algorithm Explained Simply

4 Basic Types of Cluster Analysis used in Data Analytics

Detecting outliers and anomalies in realtime at Datadog - Homin Lee (OSCON Austin 2016)

Data Science is Software | SciPy 2016 Tutorial | Peter Bull & Isaac Slavitt

